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Abstract. We propose a way of finding a Stein type characterization of a given 
absolutely continuous distribution /x on R which is motivated by a regression prop- 
erty satisfied by an exchangeable pair (W, W) where C{W) is supposed or known 
to be close to fi. We also develop the exchangeable pairs approach within this set- 
ting. This general procedure is then specialized to the class of Beta distributions 
and as an application, a convergence rate for the relative number of drawn red 
balls among the first n drawings from a Polya urn is computed. 



1. Introduction 

Since its introduction in [Ste72] in 1972 Stein's method has become a very famous 
and useful tool for proving distributional convergence. One of its main advantages 
is, that it usually automatically yields a concrete error bound for various distribu- 
tional distances. Being first only developed for normal approximation, it was ob- 
served by several authors, that Stein's technique of characterizing operators is by no 
means restricted to the normal distribution (see, e.g. |Che75] for the Poisson distri- 
bution, [Ste86j and |SDHR04] for absolutely continuous distributions on M). Highly 
recommendable accounts of Stein's method are the books |CGS11] for the normal 



approximation and [BC05] for a general introduction into the method. 
In this paper we consider an absolutely continuous distribution n on (M, B) with den- 
sity function p and corresponding distribution function F. Let — oo < a < 6 < oo be 
extended real numbers, such that supp(/i) C (a, 6) and such that for each choice of 

real numbers a' > a and b' < b we have that supp(yu) ^ (a', b) and supp(/i) ^ (a, b'), 
i.e. if a is real, then it is the left endpoint of the support of fi and if b is real it is the 
right endpoint of supp(yu). Here and in what follows, the closing operation is with 
respect to the usual topology on M. If the density p is positive on (a, b) and absolutely 
continuous on every compact subinterval of (a, b), then the so called density approach 
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within Stein's method gives a Stein identity and even a Stein characterization for n 
(see |SDHR04] . |CS11| or |CGS11| ). In short, a real valued random variable X has 
distribution /i if and only if for each function / in a suitable class J-" we have 

(1) E[r{X)+^{X)f{X)]=0, 

where ip{x) := ^logp(x) is the logarithmic derivative of p. For a given Borel- 
measurable test function h with Jj^\h{x)\d^{x) < oo this characterization motivates 
the so called Stein's equation 

(2) f\x)+ij{x)f{x) = h{x)-fi{h), 

to be solved for /, where we write fi{h) for hdfi. It turns out, that a solution fh 
to ([2]) on (a, b) is given by 

(3) fhix) := J {h{y) - fx{h))p{y)dy = J [Kv) - Kh))p{y)dy 

and that, if fh is bounded and ^ is unbounded on (a, 6), then is the only bounded 
solution on {a,b). For general properties of the solutions fh see [CSHj or |CGS11] . 
Note that for general Borel-measurable h it cannot be expected that there exists a 
solution / which is different iable on all of (a, b) and satisfies ([2]) pointwise. Thus, a so- 
lution is understood to be an almost everywhere differentiable and Borel-measurable 
function which satisfies ([2]) at all points x G (a, b) where it is in fact differentiable 
and contrary to the usual convention, at the remaining points x G (a, b) one defines 
f'{x) := —ip{x) + h{x) — fi{h). This yields a Borel-measurable function /' on (a, 6) 
such that ([2]) holds for each x G (a, b). 

In order to understand the exchangeable pairs technique in the framework of the 
density approach it might be helpful to recall the exchangeable pairs method in 
the situation of normal approximation. This method, which was first presented in 
Stein's monograph |Ste86| . is a cornerstone of Stein's method of normal approx- 
imation and is still the most frequently used coupling. This is due to the wide 
applicability of standard couplings like the Gibbs sampler or making one time step 
in a reversible Markov chain, which generally yield exchangeable pairs. By definition, 
an exchangeable pair is a pair {W, W) of random variables, defined on a common 
probability space, such that their joint distribution is symmetric, i.e. such that 

{W,W') = {W',W). In |Ste86] . in order to show that a given real-valued random 
variable W is approximately standard normally distributed. Stein proposes the con- 
struction of another random variable W, a small random perturbation of W, on the 
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same space as W such that {W, W) forms an exchangeable pair and additionally the 
following linear regression property holds: 

(4) E[W' -W\W] = -XW 

Here, A G (0, 1) is a constant which is typically close to zero for conveniently 
chosen W. If this condition is satisfied, then the distributional distance of C{W) to 
A^(0, 1) can be efficiently bounded in various metrics, including the Kolmogorov and 
Wasserstein metrics (see, e.g. [Ste86j . [CS05J or |CGS11| for the common "plug- in 
theorems"). 

The range of examples to which this technique could be applied was considerably 
extended by the work |IRR97| of Rinott and Rotar who proved normal approximation 
theorems allowing the linear regression property to be satisfied only approximately. 
Specifically, they assumed the existence of a random quantity R, which is dominated 
by XW in size, such that 

(5) E[W' -W\W] = -XW + R. 

Note, that necessarily R is cr(iy )-measurable and that unlike condition (j4]), con- 
dition ([5]) is not a true condition on the pair {W, W) since we can always define 
R := E[W' — W\W] + XW for each given constant A > 0. However, the "plug-in 
theorems" in |RR97] . |SS06| or |CGS11| clarify that R has to be of smaller order than 
A in order to yield useful bounds. Since W is supposed to have a "true" distributional 
limit, it follows that both, A and R, are at least asymptotically unique (see also the 
introduction of [RR 09] for the discussion of this topic). 

When dealing with our possibly non-normal distribution /x, the question is what 
condition to substitute for the linear regression property (jlj) or ([5]). This question 
was succesfuUy answered independently by Eichelsbacher and Lowe in |EL10j and by 
Chatterjee and Shao in |CS11] . They pointed out, that in this more general setting 
the appropriate regression property is 

(6) E[W' -W\W] = XiIj{W) + R, 

where, again A > is constant and R is of smaller order than Xip{W). In order to 
give a fiavour of the resulting "plug-in theorems", we present parts of Theorem 2.4 
from [ELlO] . 

Theorem 1.1. Let the density p be positive and absolutely continuous on (a, b). Sup- 
pose that there exist positive real constants Ci, C2 and C3 such that for any Lipschitz- 
continuous function /i : (a, 6) — )■ M with minimal Lipschitz constant \\h'\\oo, the solu- 
tion fh given by ^ satisfies 
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IIAIloo < Ci||/l||oo , ll/hlloo < C2||/i'||oo and ||/^'||oo < CsH/l'Hoo , 

where, for a function / : (a, 6) — t- M we let ||/||oo be its essential supremum norm. 
Then, for any exchangeable pair {W, W) satisfying condition ^ we have 



(7) \E[h{W)]-ii{h)\ 



< 



\h'\ 



C2E 



2A 



E[{W ~W'f\W] 



+ f^E[\W'-W\'] + 'j-^Em 



The third term of the bound ([7]) reveals that, in fact, R must be of smaller order 
than A in order for the bound to be useful and from the second term we conclude 
that A should such that E [{W - Wf] ^ 2A. The first term appearing in the bound 
on the right hand side of ((Tj) is usually interpreted such, that the random vari- 
able £'[(14^ — Vr')^|Vr]/2A must "obey a law of large numbers" to obtain decreasing 
bounds. Bounding this term is often decisive for the success of applying Theorem ll.il 



Having discussed the method of exchangeable pairs within the density approach, 
we now address the problem, that condition ([2]) with negligible remainder term R is in 
some examples not satisfied by an exchangeable pair, which, however, appears natural 
to us for our approximation problem. For example, in many situations where the 
exchangeable pair (ly, W') is constructed via the Gibbs sampler, we have a regression 
property of the form 



(8) E[W' -W\W] = \{-c{W - E[Z])^ + R, 

where A, c > are constants (the reason why A and c are not subsumed into a single 
constant will become clear later on) and where, again, i? is a negligible remainder. 
Here, again Z ^ /j,. Following the theory of the paper |CS11] . condition (|8]) suggests 
approximating W with a normal distribution with mean E[Z] and variance K But 
there are situations, where the exchangeable pair {W, W) is good, meaning that the 
difference \W' — W\ is "small", condition ([H]) is satisfied and where we know that 
W is approximately distributed as a non-normal random variable Z ^ fi and so the 
normal approximation is inappropriate. In general, this either means that the above 
discussed law of large numbers cannot hold or that the resulting error term R in 
is not negligible. These observations motivate a new version of Stein's method, that 
allows for a more general regression property. 

Suppose, that an appropriately chosen exchangeable pair {W, W) satisfies the 
following general regression property: 
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(9) E[W' -W\W] = \^{W) + R, 

where A > is constant, 7 is a measurable function, whose domain contains (a, b) 
and will be discussed further below and where is a negligible remainder term. We 
will see, that it will be advantageous if the term 7(x) ■ g{x) appears in the "new" 
Stein equation. So we make the following ansatz for the Stein identity: 

(10) E[r]{Z)g'{Z)+j{Z)g{Z)] = 0, 
where rj is another function which still has to be found. 

Starting from the Stein identity (fTOj) our aim is to identify the function r]. If this 
approach is succesful, the Stein equation corrersponding to a meaurable function h 
will be 

(11) r]{x)g'{x) + 'yix)g{x) = h{x) - /i(/i) . 

Let fh be the solution to the equation ([2]). For the solution g^ of flTT]) we make 
the ansatz gh{x) = a{x)fh{x) for some (sufficiently smooth) function a. We will 
always let h := h — fi{h) and obtain 

Vix)gh{x) = ri{x){a'{x)fh{x) + a{x)f'fXx)) 
= r]{x)a'{x)fh{x) + r]{x)a{x) [h{x) - r]{x)ilj{x)fh{x)) 

(12) = r]{x)a'{x)fh{x) + 7]{x)a{x)h{x) - r]{x)i){x)gh{x) 

= h{x) -j{x)gh{x) . 

For this identity to hold, irrespective of the test function h, it must be the case 
that a{x) = (particularly t] must be differentiable at least almost everywhere) 
and hence 

13 ol{x) = ^r\l = —^a{x). 

r][xY ri[x) 

Plugging this into f|T2|) we obtain 



Vix)ghix) = -ri'{x)gh{x) - ri{x)'ilj{x)ghix) + h{x) . 

This equals h{x) — ^{x)gh{x) if and only if 7] satisfies the ordinary differential 
equation 
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(14) 



ri'{x) = 7(0;) — ■ip{x)ri{x) . 



This is a first order linear differential equation, which can of course be solved 
explicitly by the method of variation of the constant. It turns out, that the right 
solution is given by 



at least, if l^y {t)\p{t)dt = E[\'-f{Z)\] < 00. But this is a very natural condition 
to hold, since the function 7 was motivated by the regression property and so, if 
the random variables W and W' are integrable we obtain that both sides of (jO]) must 
be P-integrable and, in fact 



Neglecting the remainder term R we thus see that ii^[7(iy)] should exist and, in 

fact, be close to zero. So since W ^ Z we find it reasonable that i?[7(Z)] exists and 
even equals zero. Furthermore, it is a matter of routine to check, that 77 as given 
in ( ITSl) indeed still satisfies (iMl) . The above calculations starting with (ITOj) were 
rather formal but crucial for the motivation and understanding of our approach. The 
paper is organized in the following way. Rigorous results and the abstract theory for 
general ^ are presented in Section [2l These results are then further spezialized to 
the Beta distributions in Section [3l In Section H] the theory combined with a suitable 
exchangeable pairs coupling is used to prove a rate of convergence of order ^ in a 
Polya urn model (see Theorem I4.3p . In Section [5] some rather lengthy or technical 
proofs can be found and a sufficiently general version of de V Hopital's rule for merely 
absolutely continuous functions is provided. This result justifies all the invocations 
of this famous rule in the present work. 



A few days after this work was on the arXiv, G. Reinert and L. Goldstein posted a 
preprint (see |GR12] ). which also develops Stein's method for the Beta distributions 
and uses a comparison technique to prove error bounds of order in the Wasser- 
stein distance for the more special Polya urn model, where the drawn ball is replaced 
to the urn together with only one extra ball of the same colour. 



(15) 




E[X-f{W) + R] = E[E[W' - W\W]\ = E[W'] - E[W] = . 



Acknowledgements 



2. The general theory 



In this section we state the main results concerning bounds on the solution of 
the Stein equation (1 11 1) and its first two derivatives and present a general "plug-in 
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theorem" within a general exchangeable pairs approach. 

In order to derive precise results, we have to impose conditions on the density p of 
/X and on the coefficient 7 from ( ITT]) , which was motivated by ([9]). In applications 
it might be the case that a sequence of random variables of unbounded support 
converges in distribution to a random variable Z with bounded support. Usually 
Stein's method for the distribution /i of Z cannot handle such approximation prob- 
lems, since the Stein equation and its solution are only defined on (a, 6) or (a, 6). 
However, in many practical cases, the coefficients in Stein's equation are given by 
certain "analytical expressions", which also make sense on M \ (a, b) and hence, also 
the Stein solution can be defined there. In order to cover these situations, too, we 
will have to assume a number of conditions in this section. Those readers, who are 
only interested in approximation problems for random sequences, whose support is 
contained in (a, b) may restrict their attention to the relevant results. For reasons of 
readability most proofs are deferred to the appendix, only some short ones and the 
proof of Proposition 12.311 are given in this section. 

In the following we always let Z be a real-valued random variable with distribution /x. 
We will always assume the following conditions on the density p and the coefficient 
7: 

CONDITION 2.1. The density p is positive on the interval {a,b) and absolutely con- 
tinuous on every compact interval [c,d] C {a,b). 

CONDITION 2.2. The function 7 : (a, 6) M is such that 

(i) 7 is continuous on (a, b) 

(ii) 7 is strictly decreasing on (a, b) 

(iii) /^^ \-f{t)\p{t)dt < 00 and in fact E[-f{Z)] = J^^ -f{t)p{t)dt = 

(iv) There is a unique xq G (a, b) with 7(0:0) = 0. 

Remark 2.3. Note that in Condition 12.21 (iv) is actually implied by (i),(ii) and (iii) 
and the intermediate value theorem. Furthermore, (iv) implies that 7 is positive on 
(a, Xo) and is negative on {xo,b). 

By item (iii) in Condition 12.21 we can define the function J : (a, 6) — ?► M by I{x) := 
Ja li.'t)p{'t)dt which is continuously differentiable on (a, b) if 7 satisfies Condition 12.21 

Proposition 2.4. Under Conditions \2.1\ and\KM the function I has the following 

properties: 

(a) I{x) = -jl^{t)p{t)dt 

(b) I{x) > for each x G {a,b) 

(c) / is strictly increasing on (a, Xq) and strictly decreasing on {xQ,b) and hence 
attains its global maximum at xq. 
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Proof. Of course, (a) is immediately implied by item (iii) from Condition 12.21 

To prove (b) and (c) first observe that by (iii) we have lim^^x^a I{x) = = lim^,/!;, I{x). 
Furthermore, I'{x) = '~f{x)p{x) is postive on (a, Xq) and negative on {xo,b) implying 
the results. □ 

We begin developing Stein's method for the distribution /i satisfying Condition 
12.11 and the coefficient 7 of the unknown function g in (ITT]) , which we assume to 
fulfill Condition 12.21 We define the function r/ : (a, 6) — )■ M by f|T5|) and for a given 
Borel-measurable test function h with < 00 we consider the Stein equation 

f|TT]) . In most cases of practical interest we will have limx\ai]{^) = if a > —00 so 
that we can define 7]{a) = to obtain a function which is continuous at a. The same 
remark holds for b (see also Proposition 12.51 down below and the discussion following 
it). Note that we can also write 'r]{x) = and so we can infer several properties of 
1] from those of the function /. 

Proposition 2.5. Under Conditions \2.1\ and \2.B the function rj has the following 
properties: 

(a) rj is positive on [a, b), absolutely continuous on every compact subinterval [c, d] C 
(a, b) and 7]'{x) = '~f{x) — ip{x)r]{x) for X-almost all x G (a, b). 

(b) hm^\^ar]{x)p{x) = ^ira^/'bV{^)p{x) = 

(c) //lim^\^„p(x) = 0, then hm^^-^aVi^) = iimj^"^^4,{x) ' ^^^^ exists. 

(d) If lim mi ^\^aP{x) G (0, 00) U {00} then lim^\^„ //(x) = 

(e) Iflim^/'bPix) = 0, then \im^/'br]{x) = n^^^;;^^^, if this limit exists. 

(f) //liminf^./n{,p(x) G (0, 00) U {00} then hm^/nf, //(x) = 

Proof. The first part of (a) follows from the fact, that / is positive on (a, b) and 
on (a, b) and hence absolutely continuous on [c, d] and that p is also absolutely 
continuous and bounded below by a positive constant on [c,d]. The rest of (a) has 
already been observed. Items (b), (d) and (f) follow immediately from the properties 
of the function / in Proposition 12.41 To prove (c), we use de I'Hopital's rule (see 
Theorem 15. ip to derive 

lim vix) = hm ^ = hm ^^^^ = . 

x\a x\a p[x) x\a p'[x) lim^\^a ljj{x) 

In a similar way one can prove (e). □ 

The following "Mill's ratio" condition on the density p and the corresponding dis- 
tribution function F is often satisfied and will yield lima;\^a ^^(x) = limx/^bVi^) — 0- 

CONDITION 2.6. The density p oi fi satisfies all the properties from Condition 12.11 
and also the following: 
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(i) If a > -OO, then \im.^\a ^ = 0. 

(ii) If 6 < oo, then lim^y^b ^'p^xf = ^■ 

Remark 2.7. (a) Condition 12.61 is always satisfied if the density p is bounded away 
from zero in suitable neighbourhoods of a and b. 

(b) Assume that both, a > — oo and b < oo and that limrc\ap{x) = limx/^bP{x) = 0. 
Then Condition 12.61 is satisfied, if there is a 5 > such that p is increasing on 
(a, a + 6) and decreasing on (6 — 6,b). This is easily seen by the inequality 

F{x) = / p{t)dt < p{x){x — a) , 

J a 

valid for x G (a, a + 5) and a similar one for the right end point b. 

(c) Suppose that a > — oo and b < oo, that limx\aP{x) = \imx/^bP{x) = and 
that there is a 5 > such that p is convex on (a, a + 6) and on (6 — 6,b). Then 
the assumptions of (b) and hence Condition 12.61 is satisfied. In fact, first we can 
extend p to a continuous and convex function on [a, b) by setting p{a) := 0. Now, 
let a<x<y<a + 6. Then, there exists a A G (0, 1) with x = Xa + {1 — X)y 
and by convexity we have: 



p{y)-pix) = p{y) ~ p{\a + (l - \)y) > p{y) - \p{a) - (1 ~ \)p{y) 
= \p{y) > 

Thus, p is strictly increasing on (a, a + 6). Similarly, one shows, that p is 
strictly decreasing on (6 — 6,b), if p is convex there, 
(d) If p is analytic at a and b, then Condition 12.61 is also satisfied. Indeed, if there is 
an r > such that p{x) = YlT=o (^k{x — aY for all x G (a, a + r), then the function 
/ : (a — r, a + r) — )■ M with /(x) := X^fclo — a)^ is well-defined. Let no : = 
min{k > Cfe 7^ 0}. Then no < 00 since supp(/i) = {a,b). If no = and hence 
/(a) = Co = liTax\aP{x) 7^ 0, then there is nothing to show. Otherwise, we have 
p{x) = (a;-a)"o XlfcLno andp'(x) = (x - a)'^o-i j^"^^^^ kckix - a)''~''° 
for X & {a,a + r) and hence, by de I'Hopital's rule 



F{x) p{x) .Etno^kix-a)'--^ 

lim — -— = lim — -— = lim [x — a) 



x\a p{x) x\a p'{x) x\a Zlfclno - a)'=-"0 

= J2!^^lini(a;-a) =0. 

noCno x\a 



□ 



The following proposition provides the announced result. 
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Proposition 2.8. Assume Condition \2.6\ . Then the function t] vanishes at the finite 
end points of the support {a,b) of fi, i.e. if a > —oo, then limrc\ai]{x) = and if 
b < oo, then limxybVi^) — 0- Hence, we may extend t] to a continuous function on 
{a,b) by letting T]{a) := 77(6) := 0. 

Proof. Suppose, that a > —00. Then, by the positivity of / and the monotonicity of 
7, for a < X < b: 



< I{x) = / -i{t)p{t)dt < 7(a) / p{t)dt = 7(a)F(x) 

J a J a 

Hence, 

I(x) F(x) 
< hminf ?7(x) < hmsup?7(x) = hmsup — — — < 7(a) hm — — — = , 

x\a x\a P{X) x\a p[x) 

SO that hmj.\^a?7(x) = 0. The proof of lim^y^hrjlx) = for finite b is similar by 
using the representation I{x) = — 'y{t)p{t)dt and is therefore omitted. □ 

From our above heuristic calculations we know that the function '■ (a, 6) — )■ M 
with 



9hix) := / (hit) - fiih))pit)dt 

p{x)r]{x) Ja 

(16) = -^^^ [ (hit) - fiihMt)dt 

p{x)r]{x) 

solves the Stein equation (fTTl) for x G {a,b). This can also be proved directly 
by differentiation and the formula for gh could also be derived by the method of 
variation of the constant using the fact that log(p • 77) is a primitive function of ^, 
which follows from (IHj). If we can show that gh is bounded, then it will immediately 
follow from Proposition 12.51 (a) that gh is the only bounded solution of (fTTll . since 

the solutions of the corresponding homogeneous equation are constant multiples of 

1 

p-v' 

Since we do not exclude approximating random variables which take on the values a 
or b, we show that the solution gh can be extended continuously to a and b, if h is 
continuous there. By the properties of the function I = prj on (a, b), from Proposition 
12. 5[ the continuity of 7 and by de I'Hopital's rule (see Theorem 15.11) we have 

(17) .i,„ = 1,™ r^W^W* _ ftWPW ^ M.) _ Ma) 



x\a I(x) x\a 7(x)p(x) x\a 7(x) 7(a) 
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and, similarly, lim^y^h 9hix) = ^|^, if h is continuous at a and b. Thus, we can 
conclude the following proposition. 

Proposition 2.9. Assume Conditions \2J\ and\2M and let h : R R be a Borel- 
measurable test function with E\\h{Z)\j < oo and being continuous at a and b. Then 
the Stein solution as defined above may be extended to a continuous function on 
R by letting 

h{a)-fi{h) h{b)-fi{h) 

9h{a) := and ghib) := — . 

7(a) 7(6) 

As is typical for Stein's method, its success within the applications considerably 
depends on good bounds on the solutions gh and their derivative(s), generally uni- 
formly over some given class of test functions h. 

The next step will be to prove such bounds. It has to be mentioned that we cannot 
expect to derive concrete good bounds in full generality, but that sometimes further 
conditions have to be imposed either on the distribution fi (e.g. through the density 
p) or on the coefficient 7. Nevertheless, we will derive bounds involving functional 
expressions which can a posteriori be simplified, computed or further bounded for 
concrete distributions. So our abstract viewpoint will pay off. Moreover, some of our 
bounds will actually hold in complete generality. 

The next Proposition contains a bound for the solutions g^ for bounded and Borel- 
measurable test functions h. 

Proposition 2.10. Assume Conditions \2.1\ and \2.S\ and let m be a median for 11. 
Then, for h : (a, 6) — )■ M Borel-measurable and bounded we have 

/.ON II II ^ ll^-^(^)l|oO \\h - fi{h)\\oo 

(18) \\gh\\oo < - 



21 (m) 2 ^{t)p{t)dt 

The proof is deferred to the appendix. The following corollary specializes this 
result to the case that 7(x) = —c{x — E[Z]) and that /i is symmetric with respect to 

its median, which is then equal to its expected value E[Z], that is Z — m = m — Z. 

Corollary 2.11. In addition to Conditions \2. 1\ and \2. 2\ assume that the distribution 
fi is symmetric with respect to m = E[Z] and that 7(x) = —c{x — E[Z]) for some 
positive constant c. Then for each bounded and Borel-measurable test function h : 
(a, b) ^ R we have 

\h - Ai(/i)||oo 



(19) WghWoo < 



cE Z-m 
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Proof. In this case we clearly have J(m) = ^E[\Z — m\] which implies the result by 
Proposition nAO[ □ 

In the case that /i = A^(0, 1) and c = 1 this result specializes to the well known 
bound - /i(/i)||oo (see or [Ug05] . e.g.). 



Remark 2.12. In the formulation of Proposition 12.10) it might suprise that there 
is no bound mentioned for ||5'^||oo- This is because, in general a bound of the form 
lls'/illoo ^ C*||^||oo does not exist with a finite constant C in this setup. Note that this 
is contrary to the density approach, where one generally has such a bound (see [CSll] 
or [CGSnj). 

Next, we will turn to Lipschitz continuous test functions h. In contrast to bounded 
measurable test functions, there we will also be able to prove useful bounds for 1 1(7^1 |oo- 

In order to obtain bounds for Lipschitz continuous test functions we need a further 
condition on the distribution fi which guarantees that its expected value exists. 

CONDITION 2.13. The density p is positive on the interval (a, b) and absolutely contin- 
uous on every compact interval [c, d] C [a,b). Furthermore, i?[ I Z I] = Jj^\x\p{x)dx < 
oo. 

The following proposition, which is also proved in the appendix, includes bounds 
for both, Qh and (7^, when h is Lipschitz. 

Proposition 2.14. Under Conditions \2.13\ and \2.S\ we have for any Lipschitz con- 
tinuous test function h : {a,b) ^ and any x G (a, b) : 

(a) \9n{x)\ < ||/.1U^^^^^^M^ 



IDJ \9h\^)\ ^ Woo vix)r,(xY 



p{x)ri{xy 

Here, for x G (a, b) the positive functions H{x) and G{x) are defined by 

H{x) := I{x) — 7(a;)F(x) = p{x)ri{x) — 7(x)F(x) and G{x) := H{x) + 7(x) . 

Remark 2.15. In general, the term S{x) := yp(y)dy ^^^j^q^ bounded 

uniformly in x G {a,b) unless I7I grows at least linearly in x. 

Corollary 2.16. Assume Condition \2.13\ and that 7(x) = c{E[Z] — x) for some 
c > 0. Then we have for any Lipschitz continuous test function /i : (a, 6) — t- M and 
each X G (a, b) : 

(a) WghWoo < ^ 

(b) \g',{x)\ < = 2\\h'\\^ S:Fisys,Cii-Fimt 
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Proof. Claim (a) follows from Proposition 12.141 (a) and the observation that in this 
case we have 

nx nx px 

I(x) = / -f{y)piy)dy = c {E\Z] - y)p{y)dy = c{E[Z]F{x) - / yp{y)dy) . 

J a J a J a 

Part (b) follows from Proposition 12.141 (b) and Lemma 15.21 by observing that in 
this case 



H{x) = I{x) --f{x)F{x) = c\^J {E[Z]-t)p{t)dt- {E[Z]- x)F{x) 
= c (^E[Z]F{x) - tp{t)dt - E[Z]F{x) + xF{x) 

fX 

= c F{s)ds 

J a 



and, similarly, G{x) = c J^{1 — F{s))ds. □ 

Remark 2.17. (i) It is quite remarkable that in the case of normal approximation 
(via its classical Stein equation) the bound given in Corollary 12.161 (a) even 
improves on the best bound 2||/i'||oo currently mentioned in the literature (see, 
e.g. |CGS11| or |CS05j ). In fact, in this case c = 1 and thus our bound reduces 
to \\h'\\oo. 

(ii) For concrete distributions the ratio appearing in the bound for g'f^{x) may be 
bounded uniformly in x by some constant which can sometimes also be com- 
puted explicitely. Nevertheless, in |EV12] the authors give mild conditions for 
the existence of a finite constant k such that ||5'^||oo < ^H^'Hoo for any Lipschitz- 
continuous h. In practice, these conditions are usually met. However, there is 
no hope of estimating the constant k by their method of proof. Thus, for con- 
crete distributions and explicit constants it might therefore by useful to work 
with our bound from Corollary 12.161 (b). 

(iii) For the normal distribution and also for the larger class of distributions dis- 
cussed in |EL10] . one also has a bound of the form ||5'^'||oo < C||^'||oo for some 
finite constant C holding for each Lipschitz function h. As was shown by a 
universal example in |EV12| . such a bound cannot be expected unless a = — oo 
and 6 = oo, if one takes 7(x) = c{E[Z] —x). This is why we will have to assume 
that h' is also Lipschitz, for example by demanding that h has two bounded 
derivatives. For the Stein solutions of the density approach, however, there are 
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many examples of distributions, whose support is strictly included in M but for 
which such bounds are available (see, e.g., chapter 13 of |CGS11] ). 

Next, we will discuss, how we can express the density p of /i in terms of 7 and rj. 
This will be useful to bound the second derivative of in some special cases. Let 
Xo be as in Condition 12.21 Since t]' = 'y — rjip and hence ip = we have 

(20) p{x) = p{xo) exp ( ^ ij{t)dt^ = p{xo) exp ( ^ ^rft) ^ 



Formula ( 120|) is a more general version of formula (3.14) in |NV09] and is also de 



rived in |KT12) . Now, differentiating Stein's equation (fTTj) . we obtain for h Lipschitz 

(22) ri{x)g'L{x) + g',Xx) {r]\x) + 7(x)) = h\x) - i\x)gh{x) =: h2{x) . 

This means, that the function g := g'^^ is a solution of the Stein equation cor- 
responding to the test function /i2 for the distribution /i which satisfies the Stein 
identity 



E 



r^(F)/'(r) + (V(r) + 7(r))/(r) 







where F ~ /x. From (l20l) we know that a density p of is given by 

(23) p(x) = A exp f r "''"+7">dA = K exp ( [' Mdt 

ri(x) \l„ ri(t) J ' V7„ ,,(() 

where K,K > are suitable normalizing constants. Thus, if we have bounds 
for the first derivative of the Stein solutions for the distribution ft and for Lipschitz 
functions h, then we obtain from this observation bounds on g'^ for h such that /i2 is 
Lipschitz (if 7(0;) = c{E[Z] — x), this essentially means that h' must be Lipschitz). 
Of course, it has to be verified, that /i(/i2) = in order to use these bounds. But 
this will follow immediately, if one can show that g belongs to a class of functions 
for which the Stein identity for ft is valid. 

In the following we are going to adress the question of how to extend Stein's 
equation and its solution on R \ (a, b). To this end, we suppose that the functions 7 
and 7] both have a natural extension to all of M, for example that they are given by 
some "analytical expression" on (a, 6) which still makes sense on M \ {a,b). We will 
need the following condition. 
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CONDITION 2.18. The functions 7 and rj are defined as before on (a, 6) and may be 
extended on M such that the following properties are satisfied: 

(i) On (a, b) the function 7 has all the properties listed in Condition 12.11 and is 
continuous and strictly decreasing on M. 

(ii) The function rj is given by (fT5|) for x G (a, b) and is absolutely continuous 
on every compact sub-interval [c, d] C M. Furthermore, rj{x) 7^ for all x G 
R\{a,b}. 

Let /i : M — i- M be a given Borel-measurable test function with < 00. 

Then we have the following Stein equation, valid now for all x G M: 



(24) rj{x)g'{x) + '~^{x)g{x) = h{x) — ii{h) =: h{x) 

We already know, how to solve flM|) for x G (a, 6). So now, we will assume that at 
least one of a and b is finite and try to solve the equation outside (a, b). Furthermore, 
we will discuss conditions that ensure that the composed solution gh behaves nicely 
at the edges a and/or b. We will henceforth assume Condition 12.181 
For X ^ a,b equation is clearly equivalent to 



(25) g'{x) = -^g{x) + 



ri{x) ri{x) 

Let us assume that both a > —00 and b < 00 (the other cases are of course 
included) and let Fi be any primitive function of ^ on (—00, a). Such a function 
exists by continuity and is hence continuously differentiable. By the method of 
variation of the constant one may derive the following formula for x G (—00, a): 



(26) gh{x) := exp{-Fi{x)) ^ ^ exp{Fi{t))dt . 



if this integral exists. Note that this property does not depend on the particular 
oice of the primitive 
we define the function 



choice of the primitive function Fi. For a fixed primitive function Fi of ^ on (—00, a 



exp(Fi) 
qi ■= — ■ 
V 

Analogously, for a given primitive function Fr of ^ on (6, 00) we define the function 

exp(F^) 
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Note that inside the interval (a, b) we have that log{rip) is a primitive of ^ and 
hence qi plays the role of p on (—00, a) (and similarly for g^)- As we have observed, 
we will need the following Condition: 

CONDITION 2.19. There exist primitive functions Fi : (—00, a) — )■ M and Fr : 
(6,00) — 7- M of ^ on (—00, a) and on (6,00), respectively, such that for each x G 
(—00, a) the function qi is integrable over [x, a) and for each y G {b, 00) the function 
qr is integrable over (6,?/]. We may thus define functions Qi : (—00, a) — )■ R by 
Qi(a;) := £ qi{t)dt and : (6, cx)) M by := £ qr{t)dt. 

Similarly, for x G (6, 00) we arrive at the definition 



r hit) r ~ 

(27) gh{x) := exp{-Fr{x)) J ^ exp{Fr{t))dt = exp{-Fr{x)) J h{t)qr{t)dt , 

if this integral exists. So, in the following we will always implicitly assume that 
also J^\h(t)qr(t)\dt < 00 and J^\h{t)qi{t)\dt < 00 both hold for each x G (—00, a) 
and each y G {b, 00). 

Remark 2.20. Note that the definition of the solution does not depend on the 
choice of the primitive functions Fi and Fr since two such functions may only differ 
by an additive constant. 

Next, we prove that the above constructed solution is continuous as long as h 
is continuous at a and b. To deal with the limits limx /'a gh{x) and limx\b gh{x) we 
first formulate a condition which will usually be satisfied in practice. 

CONDITION 2.21. The functions Fi and Fr satisfy lim^ya Fi{x) = ±00 and 
lim^\bFr{x) = ±00. 

Again, the validity of this condition does not depend on the choice of the functions 
Fi and Fr. By Condition 12.211 we may again apply de I'Hopital's rule to compute 



exp{Fi{x)) xy^aexp{Fi{x))F;{ 



X) 



/ V -, h{x)qi{x) , hix) 1 , 7, , 

28) = lim f^' = lim --^ = —— lim h(x) 

x/'a 'y[Xjqi{X) x/-a'-^[x) 7(a) a; /"a 

and, similarly, \\mx\hgh{.x) = 7:^^'^'^x\hh{x), if these limits exist. Again, the 
continuity oi h ai a and b is sufficient for this to hold. By Proposition 12.91 we can 
thus conclude the following proposition. 
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Proposition 2.22. Assume Conditions\2JM \2.19\ and \2.21\ and leth:R^Rbea 
Borel-measurable test function satisfying the above integrability conditions and being 
continuous at a and b. Then the Stein solution as defined above may be extended 
to a continuous function on R by letting 

h{a) -n{h) h{h) - fi{h) 

9h{a) := and gh(b) := — . 

7(a) 7(6) 



Next, we want to present bounds on gh{x) and g'i^{x) for x ^ {a,b). But first we 
will show that our conditions already imply that t]{x) < if x < a ot x > b. Since 
Fl{x) = is then negative on (—00, a) this also ensues that \imx/'aFi{x) = —00. 
Similarly, lim^x^h Fr{x) = — 00. 

Proposition 2.23. Assume Conditions \2.18[ l2AU and \K2l\ Then the functions rj , 
Fi and Fj. have the following properties: 



(a) For all X G M \ {a,b) we have ri{x) < 0. Especially, by continuity we have 
ri{a) = rj{b) = 0. 

(b) We have lim^^^a -^/(^) — ^^^x\bFr{x) = —00. 

Proof. To prove (a), first note, that by Condition 12.181 has no sign changes on 
(—00, a). Suppose contrarily to the assertion, that ri{x) > for all x G (—00, a). 
Then, the function qi is also positive and hence < qi(t)dt = —Qi{x) for each 
X G {—00, a). By Conditions 12.191 and 12.21] we may apply de I'Hopital's rule to 
conclude 

< hm -^'(^l =lim . -^^(^^ =lim "^'("^ 



x/'a exp{Fi{x)) ajz-a 2Mexp(Fi(x)) ^/'«7(x)gi(x) 

1 -1 
= - hm — - = — - < , 

x/^a 7(xj 7(aj 

by Condition 12.181 This is a contradiction and hence we must have ri{x) < for 
X G {~oo, a). Similarly, one shows that also 7]{x) < for x G [b, 00). 
To prove (b), note that F[{x) = ^ < for X < a. By Condition l2.21l this necessarily 

implies that lim^y^a Fi{x) = —00. Analogously, we have > for x > 6 (since 

7(x) < there) which by Condition 12.211 implies that limrc\b Fr{x) = —00. □ 

Proposition 12.231 particularly implies the conclusion of Proposition 12.81 and hence 
makes Condition 12.61 redundant, at least as far as the assertion of this proposition 
is concerned. In order to get general bounds, we will need yet another condition on 
the functions Fi and F^ 
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CONDITION 2.24. The functions Fi and Fr satisfy lim^\^_oo = ^^^x/'oo F'rix) = 
+00. 

The next result gives bounds on Qh for bounded, Borel-measurable functions h. As 
usual, the proof is in the appendix. 

Proposition 2.25. Assume Conditions \2JM \2.1<A \2.21\ and \2.24\ and let m he a 
median for fi. Then, for any bounded and Borel-measurable test function /i : R — t- M 
we have 



\gh\\oo < \\h - ;u(/i)||oomax| 



1 -1 



2/(m)' 7(a)' lib) 



Before turning to Lipschitz test functions, we dicuss properties of the functions 
exp(F;) and exp(Fj.), respectively. In particular, we will show, that they correspond 
to the function / on (a, b) and have similar integral representations. 

Proposition 2.26. We define the functions Ii : (-00, a) -)■ M and 

Ir '■ {b, 00) —^M.by Ii{x) := exp(F;(a:;)) and Ir{x) := exp(Fr(x)) . Then the following 

representations hold: 

(a) For each x G (—00, a) we have Ii{x) = i{t)qi{t)dt. 

(b) For each x G {h,oo) we have Ir{x) = J^^ 'j(t)qr(t)dt. 
In particular, these integrals exist. 

Proof. We only prove (a). Let x G (—00, a) and let {an)nm be any sequence in 
(—00, a) with lim„_).oo a,n = a. Then, for each n E N 

[\{t)qi{t)dt = r^exp{Fi{t))dt= r F;{t)exp{Fi{t))dt 

Jan J an l\ ) Jan 

e^ds = exp[Fi{x)) - exp(Fi(a„)) 

Fi{an) 

exp{Fi{x)) , 

by Proposition [2.231 Hence J^^ 7 (it exists and equals //(a;). □ 
Proposition 2.27. Assume Conditions\2.13[ \2AR, \2.19\, \2.21\ and\2.24\ Then, for 



any Lipschitz- continuous test function /i : M — R the following bounds hold true: 

(a) For each X G (—00, a) we have \gh{x)\ < \\h'\\oo ^'''^'^^^^j^^j^^ sqi(s)ds ^ 

(b) For each x G (6, 00) we have \gh{x)\ < ||/,/||^QiM:^^zi^fMf)^. 
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(c) For each x G {—oo, a) we have 



\9'h{x)\ < 



\h'\ 



-r]{x)Ii{x) 



'j{x)(-xQi{x)+ / tqi{t)dt 



+ {E[Z]-x) [Qi{x)^{x) - Ii{x) 
(d) For each x G (6, oo) we have 



\9'h{^)\ < 



\h'\ 



'-i{x)[xQr{x) 



—ri{x)Ir{x 

+ {X- E[Z]) (^^{x)Qr{x) - Irix 



tqr(t)dt 



Corollary 2.28. Assume Conditions \2.13[ \2.18\ \2.19i \2.21\ and \2.24\ and that 
'y{x) = c(^E[Z] —x) for some c > 0. Then, for any Lip schitz- continuous test function 
: M — 7- M one has the following hounds: 

(a) ||(7/,||oo < ^ 

(b) For each x G (— oo,a) we have 

^{x)(-xQi{x) + j;^tqi{t)dt 

\g'H{x)\ < 2||/i'||oo ^ ^^y^^ 

-r]{x)Ii{x) 

(c) For each x G {b, oo) we have 

7(x) [xQr{x) - Jl^ tqr{t)dt 

K(^)l<2||/^'||oo ^ --y—. . 

-r]{x)Ii{x) 

Again, the proof is in the appendix. 
Next, we will give a short account of the properties of the solutions corresponding 



to the test functions h. 



'-(—00,2] ) 



z G 



yielding the Kolmogorov distance. In 



general, these functions are only of interest for z G (a, b) since for any real- valued 
random variable W and any z < a we have 

P{W <z)- P{Z <z)= P{W < z) 
and for any z > 6 we have 

P{W <z)- P{Z <z) = 1 - P{W <z)= P{W > z) . 
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These probabilities have nothing to do with the distribution jj, oi Z and hence, one 
should be able to bound them as well directly for a given W . Thus, we will focus on 
z e (a, h). 

Proposition 2.29. Let z G (a, 6) he given. Then, under the Conditions \2.1\ \2.lB, 
\2.1<A and \2.21\ we have: 



9z[x} 



( {l-F{z))Qi{x) 

iiM 

f(x){1-F{z)) 

I{x) 
F{z){l-F{x)) 

Ux) 
~F{z) 



X < a 
X = a 
a < X < z 
z < X < b 
X = b 



X > b 



7(6) 

-F{z)Qr{x) 

Ir(x) 

Furthermore, one has the hounds \\gz\\(x> = ^ ^^i(z) ^'^^ ^^'^^ 
sup2g(a fc) 11(7^1100 < oo. The functions are absolutely continuous on every compact 
sub-interval of M and 



9z[x) 



-{l^F{z))qi(x)(^{x)Qi{x)-Ii{x)) 

{l-F{z))p{x){I(x)-^{x)F{x)) 
I{xY 

-F{z)p{x)(I{x)+'i(x)(l-F(x))) 
Uxf 

-F{z)qr{x)(Ir\x)-^(x)Qr{x)) 

TAx^ 



X < a 
a < X < z 
z < X < b 
X > b 



Remark 2.30. (i) It is clear, that a similar discussion of the solutions g^ is possible, 
if a = —oo or & = oo. 
(ii) Note, that we can write g'^{x) = ^^~-^^^))p^^^^^^^^ for x G (a, z) and 



9z[x) 



'F{z)p{x)G{x) 



for X G {z, b), with the functions H and G from Proposition 
12.141 For concrete distributions one may often prove, that g'^ is increasing on 
(a, z) and decreasing on {z, b), but this seems to be hard to prove in generality, 
if it is true at all. 

Finally, in our general setting, we will prove suitable "plug-in theorems" for ex- 
changeable pairs satisfying our general regression property (j9]). As was observed 
in |Rol08j for the normal distribution, in case of univariate distributional approx- 
imations, one does not need the full strength of exchangeability, but equality in 
distribution of the random variables W and W is sufficient. This may allow for a 
greater choice of admissible couplings in several situations, or at least, relaxes the 
verification of asserted properties. 
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In the following, let {Q, A, P) be a probability space and let W, W be real- valued 
random variables defined on this space such that W = W . Let, as before, be 
our target distribution with support (a, 6) fulfilling Condition 12.11 . From now on we 
will assume, that the random variables W and W' only have values in an interval 
/ 5 (a, h) where both functions rj and 7 are defined (recall that it might be the case 
that T] can only be defined on (a, h)). 

Proposition 2.31. Assume, that 7 satisfies Condition \2.^ and that W is square 
integrable with E[\'~f{W)\] < 00. Furthermore, let the pair {W, W) satisfy the general 
regression property Let f : I ^ M. be an absolutely continuous function with 
II /II 007 II /'II 00 < 00, where /' is a given, Borel-measurable version of the derivative 
of f . Then we have 



E[r^{W)f'{W)+^{W)f{W)\ 



1 



< WfW^E \r,{W)-—E[{W' -Wf\W] 



(29) 



iW' - Wf I (1 - s)\f'{W) - f'{{W + s{W' - W)) \ds 
^ -E[\R\]. 



If f is also absolutely continuous and ||/"||oo < 00 for some Borel-measurable 
version f" of the second derivative, then we also have the bound 



(30) 



E[7^{W)f{W)+^{W)f{W)] 



1 



< Wf'W^E \r]{W)-—E[{W'-Wy\W] 



\\f"\ 



6A 



2A 

-E[\W'-W\'] 



A 



-E[\R\]. 



Proof. Let Xq be as in Condition 12.21 and define the function G : / — )■ M by G{x) := 
fx f\y)^y- Then, by a suitable version of Taylor's formula, for each G / we 



have 
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G{x')-G{x) = G'{x){x'-x)+ / [x' -t)G"{t)dt 

J X 

= f{x){x'-x)+ I {x' -t)f'{t)dt 

= f{x){x' — x) + {x — x)"^ / {1 — s)f' [x + s{x' — x))ds . 

Jo 

Hence, by distributional equality, we obtain 



= E[G{W')] -E[G{W)] 

E[f{W){W' -W)]+E {W - Wf / (1 - s)f'{{W + s{W' - W))ds 

E\f{W)E\W' -W\W] \ +E 
\E[f{WMW)]+E[f{W)R] 
+E 



{W - wf I (1 - s)f'{{W + s{W' - W))ds 





{W'-Wy {1- s)f'{{W + s{W' -W))ds 
Jo 



yielding 
(31) 



E[f{Wh{W)] : 

This immediately implies the identity 



{W'-Wy / {l-s)f{{W+s{W'-W))ds --E[f{W)R]. 
Jo ^ ^ 



E[r]{W)f{W)+^{W)f{W)] 



E 



f{W){v{W)--{W'-Wr) 



4- 



(W'-Wy / il-s)(^f'iW)- f'{{W + siW' -W))ys 



(32) 



-E[fiW)R]. 



X 



From fl32l) and the assumptions on / the bound (l29l) now easily follows. To prove 
301) it suffices to observe that 
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f'iw) - f'{{w + s{w' - w)) < iir Hoc s\W'-W 



and s(l — s)ds 



1 

6' 



□ 



Remark 2.32. (i) From the first term on the right hand side of ( l29i) we see, that 
the bound can only be useful, if — ^ 2Xr](W). Similarly, the 

third term reveals, that, indeed, R should be of smaller order than A. 
(ii) The proof shows, that Proposition 12.311 can easily be generalized to the situa- 
tion, where there is a a-algebra J-" with a{W) C C ^ and the more general 
regression property 



with some J-'-measurable remainder term R is satisfied, 
(iii) If H is some class of test functions, such that there are finite, positive constants 
Co, ci and C2 with \\gh\\oo < Cq, Hfi'^IU < Ci and ||5'^'||oo < C2 for each h e H, 
then fl30l) immediately yields a bound on the distance 



In this section we will specialize the abstract theory from section |2] to the family 
of Beta distributions. This is not the first time, that Stein's method for Beta distri- 
butions is considered. In |iSch01] the Stein equation is found, but no bounds on the 
solutions are given. Specialized to the Beta distributions, the general results from 
the papers |KT12] and |EV12| yield useful bounds on the Stein solutions and their 
first derivatives. Our bounds are qualitatively comparable to those from |EV12] . 
The class of Beta distributions is rather large including as special cases the arc- 
sine distribution, the semi-circle law and the uniform distribution over an interval. 
Usually, the family of Beta distributions is defined as a family of distributions on 
[0, 1] depending on two parameters a,b > 0. Recall, that for a,b > 0, the Beta 
function B(a,b) := — x)'^~^dx is well-defined and hence, the function 

1a,b{x) := ^ IS & probability density function. The cor- 

responding distribution is called the Beta distribution on [0, 1] corresponding to 
the parameters a and b. 

However, in some cases, including the semicircular law, it is more convenient to con- 
sider the corresponding distribution on [—1,1]. Using the substitution rule, it is a 
matter of routine to check, that for a,/? > — 1 the function Pa,i3{x) := C(a;,/3)(1 — 
x)"(l-|-x)^l(_i^i)(x) is also a probability density function, where C{a, (3) := 



(33) 



E[W' -W\J^] =X-i{W) + R 



dn{^^,C{W))=snvE[h{W)\ - E[h{Z)\ 



3. Stein's method for Beta distributions 



24 



CHRISTIAN DOBLER 



2a+fi+ir(a+i)r{i3+i) ■ Here , we have used the well-known relation B{a, b) = ^^l^f^ be- 
tween the Beta function and Euler's Gamma function. The probability distribution 
corresponding to the density Pajj will be denoted by /io:,/9 and is called the Beta 
distribution on [—1, 1] corresponding to the parameters a and /3. We denote its dis- 
tribution function by -Fa,/3- Thus, Fa^i^i^x) = J^^Pa,i3{t)dt, a; e M. One can easily 
see that if X ~ Ua,b and if we define Y :— 2X — 1, then Y ~ Conversely, if 

Y ~ A*a,^ and if X :— then X ~ i/^+i^a+i- As this transformation is merely a 
matter of translation and scaling it will only be necessary to develop the theory for 
one of these intervals. For definiteness we choose the interval [—1, 1] but will later 
on transfer our theory to the corresponding distributions on [0, 1]. 

From now on, fix a, /3 > —1. Now, we introduce a Stein identity for the Beta 
distribution fia,i3- It is easily checked, that its density Pa,j3 satisfies the ordinary 
differential equation 



(34) = (x^ - - (a + P)xp^A^) + (/3 - a)p^,p{x) =: Af{x) 

Integrating by parts one obtains the conjugate operator L := A*, which is defined 
by the equation < Af,g >i,2—< f,A*g >i2, and which is known to serve as a 
characterizing operator for the distribution //a,/3- To be concrete, in our case we have 

Lg{x) = (1 - x'')g'ix) - (a + ^9 + 2)xgix) + - a)g{x) 
for smooth enough functions g, yielding the Stein identity 



E 



(1 - Z')g'{Z) -[(« + /? + 2)Z +{a- ^)]g{Z) 



0, 



where Z ~ A*a,/3- 

Next we want to show that the operator L characterizes the distribution fJ,a,p- To 

do so, we introduce the function Qa,p{x) :— C(a,/3)(1 — x)'^'^^{l -|- = 
(1 — x'^)pa,p{x) and the class /Cq,,/? consisting of all continuous and piecewise contin- 
uously differentiable functions g( : M ^ M vanishing at infinity with 

\g\x)\Qa,j3{x)dx < oo. Our observations lead to the following proposition. 

Proposition 3.1 (Stein characterization for ^a,(5)- A real-valued random variable 
is distributed according to Ha,(5 if o-nd only if for all functions g G )Ca,i3 the expected 
values E[{1 - X'^)g'{X)] and E[{a + j3 + 2)Xg{X) + (a - (5)g{X)] exist and coincide. 
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Proof. First, let C{X) = ^a,i3 and let g G ICa,i3- By the hypothesis and the transfor- 
mation formula, we have 



oo > / \g'{x)\go,^^{x)dx = C{a,/3) / |^'(x)|(l - + 
Jr J-i 

= C{a,P) [ {l-x^)\g'{x)\{l-x)''{l + x)^dx = [ \{1 - x^)g'{x)\p^^^{x)dx 
= E[\{l-X')g'{X)\]. 

Hence the expexted value E[{l~X'^)g'{X)] exists. Since g is continuous, it is bounded 
on [—1, 1] and so the expected value E[{a + (3 + 2)Xg{X) + (a — f3)g{X)] exists, too. 
Again, by the transformation rule and since g and ga,i3 are absolutely continuous on 
[— 1, 1] we can use integration by parts and have 



E[{l-X')g'iX)] 

1 - x'^)g'{x)pa,p{x)dx = J g'{x)Qa,p{x)dx 
1 

1 



0-C(a,/3) / g{x)[{l3 + l){l + xf{l-xY+^-{a + l){l-xY{l + xf^^]dx 



1 



g{x)[(5 -a-x{P + a + 2)]C(a, - + xfdx 

1 

g{x)[a — P + x{a + (3 + 2)]pa,i3{,x)dx 
= E[{a + P + 2)Xg{X) + {a-l3)g{X)]. 

For the converse fix an arbitrary z G M\{— 1,+!} and consider the solution gz to 
Stein's equation 



(35) (1 - x^)g\x) -(« + /? + 2)xg{x) + (/3 - a)g{x) = hz{x) - fia,/3 {{-oo, z]) , 

where := l(-oo,2]- It will be shown in Proposition 13.21 below that g^ G /Ca,/?, so 
that by hypothesis we have 
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= E[{l-X^)g'^{X)-{a + l3 + 2)Xg{X) + {l3-a)g{X)] 
= E[h,{X)-ii^^p{{-oo,z\)] 
= P{X < 2) -/i„,^((-oo,;2]) . 

Since z G ]R \ { — 1,+1} was arbitrary and by continuity from the right of distri- 
bution functions, the proof is complete. □ 

Since we have fixed the parameters a and /3, henceforth we may and will suppress 
them as sub-indices at objects which might well depend on them (for example we 
will simply write p for pa^^ and so on). As we would like to use the theory from 
section |2] we have to make sure, that our Stein identity for the Beta distribution fits 
into this framework, i.e. that relation (fT5|l is satisfied with rj{x) = 1 — and 

7(x) = -(a + /3 + 2)x + (/?-«) = -[{a + (3 + 2)x + {a-(3)] = {a + P + 2){E[Z] - x) , 

where we have used, that E[Z] = ^ . In principle, this is clear, because we 
have just established a Stein characterization for fi and given the density p and the 
function 7, the corresponding rj is, of course, unique. However, we give a formal 
proof. 

According to (ITSl) we must show that 



(36) {l-x^)p{x) = j [-(a + /3 + 2)t+(/3-a)]p(t)dt 

holds for all x G (—1, 1). First note, that 

,, , p'ix) -ia + (5)x + (5 - a 

= = ^ ~2 • 

p[x) 1 — X'^ 

Differentiating the left hand side of (136|) . we obtain 

— (1 — x^)p(x) = —2xp{x) + {l — x^)p'{x)=p{x){^—2x + {l — x^)il){x)) 

(JjJb 

= p{x)[-{a + P + 2)x + - a)] 
which is of course the derivative of the right hand side, too. Since 

lim^(l - x^)p{x) = = lim^ J [-{a + (3 + 2)t+{/3 - a)]p{t)dt 

relation (l36l) is proved. Note, that we may extend the functions 7 and rj to functions 
on M by just the same "analytical" expressions as above. Next we will show that all 
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the conditions from section [2] are satisfied in the special case of Beta distributions. 
It is easy to see, that Conditions 12. ![ 12. 2[ 12.21 and 12.181 are satisfied by the functions 
7, Tj and p. 

Condition 12.61 is also satisfied but need not be proved, because its most important 
conclusion, namely that ri{l) = f]{—l) = is clear from the above discussion. To ver- 
ify Conditions 12.19^12.211 and 12 . 24^ we must first define the functions on (— oo, — 1) 
and Fr on (1, oo). We claim, that the functions 

Fi{x) := {a + 1) log(l - x) + {/3 + 1) log(-l - x) , x < -1 

and 

Fr{x) := {a + 1) log(x - 1) + (/3 + 1) log(l + x) , x > 1 
do the job. In fact, 

Fl{x) = ^^(-1) + ^±1(-1) _ -(a + /3 + 2)x + /3 - a _ 7(x) 



X —1 — X 1 — x^ ?7(x) 



for each x < — 1 and similarly Fl.{x) = for x > 1. From these two functi 
we obtain the functions qi : (— oo, —1) — t- M and g,, : (1, oo) — )■ M defined by 



ions 



, , exp(Fi(x)) (1 -x)°+V-l -x)^+i , 

qiix) := ^\ ' = ^ ^-^^ ^— = (1 - x)"(-l - x)^ 

rj{x) 1 — x^ 



and 



These two functions are of course locally integrable and hence. Condition 12.191 is 
satisfied. Since 

lim Fiix) = — oo = limK,(x) 

and 

lim Fi{x) = +00 = lim Fr{x) , 

x— oo x—^oo 

Conditions 12.211 and 12.241 also hold. Consequently, all results from section |2] are 
valid in particular for the case of Beta distributions. 

Now, let /i : M — > R be a given Borel-measurable test function with 

J^\h{x)\p{x)dx < oo, ^ \ h{x)\{—qi{x))dx < oo for each x < — 1 and 

J^\h{x)\(~qr{x))dx < oo for each x > 1 and let h = h — yu(/i). Consider the 
corresponding Stein equation 
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(37) (1 — X )g'{x) — {a + P + 2)xg{x) + {/3 — a)g{x) = h{x) — fia,/3{h) =: h{x) . 

Then, according to the theory from section [21 the Stein solution corresponding to 
(137]) is given by the function gf/j : M — )■ M with 

exp(— Fi(x)) J^-^h{t)qi{t)dt , x < —1 
fe(-i)-Mfe) T = -^ 

2/3+2 ' 

ii^J)p(.) I-i Ht)pit)dt , -1< a; < 1 

h{l)-f^ih) ^ . 

2o+2 ' _ J. -L 

exp(— Fr(x)) h{t)qr{t)dt , x < —1 



(38) ghix) 



Here, the values of gh &t ±1 are arbitrary, but they are chosen such that gh is con- 
tinuous, whenever h is continuous at ±1. This follows immediately from Proposition 

The next result, which is also proved in the appendix, completes the proof of 
Proposition [nm by showing that the solution g^ is in the class K,^ ^ whenever z ^ ±1. 

Proposition 3.2. For each z e R \ {-1,+1} the solution g^ belongs to the class 
K,^ p , which was defined above before Proposition \3.1\ 

Next, we will derive some results for the solutions g^ from corresponding results 
in Section [21 

Proposition 3.3. Let : M — )■ R be bounded and Borel-measurable and let m be a 
median for fi. Then, we have the bound: 



IghWoc < \\h - /i(/i)||oomax 



2(1 - m2)p(m) ' 2/3 + 2 ' 2a + 2 



Proof. Since 7(— 1) = 2/3 + 2 and 7(1) = —2a — 2, this immediately follows from 
Proposition 12.251 □ 

Proposition 3.4. Let /i : M — t- M &e Lipschitz- continuous. Then, we have the 
following bounds: 

(a) \\gh\\oo < 

(b) There exists a constant Ki, only depending on a and /3 such that 

Wg'hWoo < Ki\\h'\\^. 
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The proof is in the appendix. 

Now, consider a twice differentiable function /i : M — i- R with bounded first and 
second derivative. Recall the discussion following Remark 12.171 It easily follows from 
(123|) that, in the case of Beta distributions, we have fla,i3 = and we have 

to show that the function g := g'f^ satisfies the Stein identity for ^^+1,13+1, for 
Y ~ we have 



(39) E (1- Y)''~g'{Y) + [-{a + /3 + 4)y + /3 - 



0. 



The following lemma, which is proved in the appendix, will be useful. 

Lemma 3.5. For the Beta distribution fia,i3 and a given bounded, Borel-measurable 
function m : M — )• the Stein equation 

(40) ■r]{x)f'{x) + 7(a;)/(x) = u{x) 

has a bounded solution f on (—1, 1) if and only if E[u{Z)] = 0. 

Now, from Proposition 13.41 (b) we know that g is bounded and from (122|) we know 
that g satisfies the Stein equation corresponding to fi for the test function 
h2{x) = h'{x) — '~f'{x)gh{x). By Lemma [3.51 we thus have that 

Jj^h2{x)dfia+i,i3+i{x) = 0. Hence, g must be the only bounded solution to the Stein 
equation for fJ^a+i,i3+i corresponding to the test function /i2. Since 

h2{x) = h'{x) - i{x)gh{x) = h'{x) + (a + /3 + 2)gh{x) 

we have 

h'^{x) = h"{x) + {a + l3 + 2)g'^{x) 

and from Proposition 13.41 we see that /i2 is Lipschitz with minimal Lipschitz con- 
stant 

ll^slloo < \\h"\\oo + (« + /? + 2)7^1 («, mh'Woo 
where the constant i^i(a,/3) from Proposition 13.41 only depends on a and /3. Ap- 
plying Proposition 13.41 again, this time to the distribution ^ct+i,i3+i and the Stein 
solution g, we obtain 

ll^hlloo = ll^'lloo <i^l(«+l,/3 + l)||/i2lloo 

< Ki(a + 1, /3 + l)(||/i"||oo + (« + /? + 2)K^{a, /?) ||/i'||oo) • 
Hence, there is a constant K2 depending only on a and /3 such that 
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lk;'||oo<i^2(||/i'l|oo + l|/i"l|oo) 

for all twice differentiable functions h with bounded first and second derivative. 
We have thus proved the following proposition. 

Proposition 3.6. There exists a finite constant K2 depending only on a and (5 such 
that for each twice differentiable function /i : M — M with hounded first and second 
derivative we have the hound 



\\gh\\oo<K2{\\h'\\^ + \\h"\\^). 

Now we are in the position to provide a "plug-in theorem" for the Beta approxi- 
mation using exchangeable pairs. 

Theorem 3.7. Let W, W he identically distrihuted, real-valued random variahles on 
a common prohahilty space {Q, A, P) satisfying the regression property 

E[W' - W\W] = X-f{W) + R 

for some constant A > and a random variahle R. Then for each twice differen- 
tiahle function h with hounded first and second derivative and with E\\h{W)\\ < 00 
we have the bound 



\E[h{W)]-^^^^p{h)\ 
< K,\\h'\UE\\r^{W)-^E[{W' -Wf\W] 



2A 



6A 



\h'\ 



-E[\R\] 



(a + /3 + 2)A 

where the constants Ki and K2 are from Propositions \3.4\ and \3.b\ respectively. 

Proof. This immediately follows from Propositions 12. 3H 13. 4[ 13.61 and since is a 
solution to Stein's equation f fTTl) . □ 



In the following we will transfer the developed theory to the Beta distributions 
Va,b on [0)1]- We start with the Stein identity for Va^, where a, 6 > are fixed 
parameters. Let X ~ Va,bi then Y := 2X — 1 ~ /ib„i ^-i and hence for each smooth 
enough function / we have 



= - Y')f'iY) - (a + b)YfiY) + (a - . 
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Let f{x) := f{2x - 1). Then f'{x) = 2f{2x - 1) and /(X) = f{Y). Hence, we 
obtain 



) = E[{l-Y')f'iY)-ia + b)YfiY) + ia-b)fiY)] 

= E[AX{1 - X)f'{2X - 1) - (a + b){2X - l)f{2X - 1) + (a 
= E[2X{1 - X)f'{X) - 2(a + b)Xf{X) + 2af{X)] 
= 2E[X{1 - X)f'{X) - (a + b)Xf{X) + af{X)] . 

So, a Stein identity for X ~ z/^^b is given by 



6)/(2X - 1)] 



E 



X{1 - X)f\X) + [-{a + b)X + a] f{X) 







for all smooth enough functions / : M — > M. Hence, for Ua.b the functions rj and 
7 are given by ri{x) = x(l — x) and 7(x) = —(a + b)x + a = (a + &)(^^ — a;)- 



Note, that E[X] 



z/a^fe. Having derived the Stein identity for Ua^, the 



Stein equation corresponding to a Borel-measurable test function /i : M — )■ 
J^\h{x)\dua,b{x) < oo is given by 



with 



(41) x(l - x)f'{x) + [-{a + b)x + a] f{x) = h{x) - Va,b{,h) =■ Hx) . 

Let a test function /i : R — )■ M with J^\h{x)\di'a^b{x) < oo be given and let hi{y) : = 
^(^2^)' Consider the above constructed solution g to the Stein equation for Hb~i,a~i 
corresponding to the test function hi. In the folllowing let the real variables x and 
y be related by ?/ := 2x — 1 or x := Letting f(x) := 2g(2x — 1) and noting 

f'{x) = 4:g{y) we obtain 



x(l — x)f'{x) + [— (a + b)x + a] f{x) 

= 4^(1 - ^) + [-{a + b){y + l) + 2a]g{y) 

= (1 - y^)g'{y) +[-ia + b)y + a- b] g{y) 
= hi{y) - fib-i,a-iihi) = h{x) - Va,bQi) , 

since for any admissible function u : M — t- M we have = where 

Ui{y) := Hence, / is a solution to (14T]) . Thus, we immediately get bounds on 

the solutions of the Stein equation for Ua.b from our above developed theory. In the 
following, we will always denote by fh the Stein solution to (HTl) which is constructed 
in the explained way. 

Proposition 3.8. Let h -.R^R be Borel-measurable with J^\h{x)\duafi{x) < oo. 
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(a) Ifh IS bounded, then \\fh\\oo < \\h - Va,bQi) lU max(^ ^^^^_J^^^ ^^^^ , i, where 
m is a median for Va,b- 

(b) If h is Lipschitz, then ||/h||oo ^ ^l^ll^'lloo '^iT'd WfhWoo ^ C'lH^'Hoo? where Ci only 
depends on a and h. 

(c) // h is twice differentiable with bounded first and second derivative, then 
WfhWoo ^ C'2(||^'||oo + ||^"||oo); where C 2 only depends on a andb. 

Proof. Claim (a) follows from Proposition 12.251 Since fh{x) = 2gh{2x — 1) for all 
X G M, (b) and (c) follow from Propositions 13.41 and 13.61 □ 

Now, let V, V be identically distributed, real-valued random variables on a com- 
mon probability space {fl,A,P). For the approximation of C{V) by Ua^b the general 
regression property from Section [2] is 

(42) E\V'-V\V] =X{a + b)(-\-V)+R, 

where, again, A > is constant and i? is a hopefully small remainder term. For 
the distribution Ua^b Theorem 13.71 becomes the following: 

Theorem 3.9. Let V, V be identically distributed, real-valued random variables on 
a common probability space {Q,A,P) satisfying equation (J^ l- Then, for each twice 
differentiable function /i : R — )■ M with bounded first and second derivative and with 
E\\h{V)\\ < oo we have the bound 



\E[hiV)] - u^,bih) 
< Cill^'lloo-E' 

C2{Wh'Wc 



Vil-V)--E[{V'-Vr\V] 



+ l|/^"l|oc) 



2Wh'\ 



6A 

-^0^1] 



E[\V'-V\^] 



{a + b)X 

where the constants Ci and C2 are those from Proposition \3.8i 

Proof. The assertion is clear from Propositions 12.311 and 13.81 and since fh is a solution 
to Stein's equation (HT]) . □ 



4. Application to the Polya urn model 

In this section we prove a quantitative version of the fact that the relative num- 
ber of drawn red balls in a Polya urn model converges in distribution to a suitable 
Beta distribution, if the number of total drawings tends to infinity. This model will 
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serve as an application of our Stein method for the Beta distribution, as developed 
in section [3l We start by introducing the stochastic model: 

Imagine an urn containing at the beginning r red and w white balls and fix an 
integer c > 0. At each time point n G N a ball is drawn at random from the urn, 
its color is noticed and this ball together with c further balls of the same color is 
replaced to the urn. For each n G N let X„ be the indicator random variable of the 
event that the n-th drawn ball is a red one. Then S'„ := Yl^=i-^j denotes the total 
number of drawn red balls among the first n drawings. It is a well-known fact from 
elementary probability theory that for each n G N and all Xi, . . . , x„ G {0, 1} we have 

r^iv _ V - \ - nto ('^ + nj=o^''(^ + cj) 

r{J^l — Xi, . . . , A„ — Xnj — T-fU-li 



nr=o {s+w+ ci) 

where k := X]j=i^i- This shows particularly that the sequence {Xj)j^fi is ex- 
changeable. 



It now follows, that for each A; = 0, . . . , n we have 



P{Sn = k) -- 

or, with a := - and 6 := — , 



n 



rn-l. 



( fc ) (n-fc) 



P{Sn -k) - • 

The distribution of Sn is usually referred to as the Polya distribution with pa- 
rameters n, a and h. It is a well-known fact that the distribution of ^Sn converges 
weakly to the distribution z/^ ^ as n goes to infinity, where the Beta distribution Va^b 
was defined in section [3l A convenient way to prove this weak convergence result is 
to use the formula 



(43) P{Sn = k)= [ b{k-n,p)dua,b{p), 

Jo 

together with the weak law of large numbers for Bernoulli random variables to 
deal with the binomial probabilities b{k;n,p) = (^)p'^(l —p)"'~^. 

Formula (H3|) can be proved by a straight-forward computation using the relations 
B{a + 1, 6) = ^^i?(a, b) and B{a, b) = B{b, a) for the Beta function, where a, 6 > 0, 
and can also be viewed as a consequence of a special instance of de Finetti's rep- 
resentation theorem for infinite exchangeable sequences. Note, however, that one 
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generally does not know the corresponding mixing measure from de Finetti's theo- 
rem and hence, identity (H3l) is not a direct consequence of this theorem. 

From now on, we will present a Stein's method proof of the above distributional 
convergence result and, as usual, also derive a rate of convergence. We will usually 
suppress the time index n and let V := Vn '■= ^Sn denote the random variable of 
interest. For the construction of the exchangeable pair, we use the well-known Gibbs 
sampling procedure with the slight simplification, that due to exchangeability of 
Xi, . . . , Xn we need not choose at random the index of the summand from Sn, which 
has to be replaced. Instead, we will always replace Xn by X^, which is constructed 
as follows: 



Observe Xi = Xi,...,Xn = Xn and construct X'^ according to the distribution 
= xi, . . .,Xn-i = x„_i). Then, letting V := ^ := V - ^X^ + ^X^, the 
pair (V, V') is exchangeable. In order to use Stein's method of exchangeable pairs, we 
need to establish a suitable regression property. This is the content of the following 
proposition. 

Proposition 4.1. The exchangeable pair {V^V) satisfies the regression property 

E[V' -V\V]= . " + ^ ^. (^ -V)= X^.,{V) , 
n[a + + n — I) \a + / 

where 7,,;,(x) = (a + 6) - x) and A = A„ = ^^^^i^- 

Proof. We have ^ ^ ^ = ^ ~ „ and by exchangeability of Xi , . . . , X„ it clearly 
holds that i?[X„|y] = i?[X„,|S'„] = ^Sn = V . Also, by the definition of X^ and since 
X^ only assumes the values and 1 we have for any Xi, . . . , G {0, 1} 



-^[^nl^l — 3^1; • • • ! — ^n] — E[Xn\Xi — Xi, . . . , X„_i — Xn-l] 



— PiXn — l\Xi — Xi, . . . , Xn-l — Xn-l) — ; ; — 7 ^ 

r + w + c[n — 1) 

and hence, 

'^ + cEj=i^i _ r + cnV-cX„ 



■\n—l 

^ r -h c > 



r + w + c{n — 1) r + w + c{n — 1) 
Thus, since (j{y) C cr(Xi, . . . , X„), we obtain 
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E[X'n\V] = E 



E[X'JX^,...,Xr,] \V 

r + cnV — cV r + c{n — 1)V 



Finally, we have 



E[V' -V\V] 



r + w + c{n — 1) r + w + c{n — 1) 
a + {n-l)V 
a + b + n — 1 



n na+b+n— 1 n 



a - {a + b)V 



a + b 



V 



n{a + b + n — 1) n{a + b + n— l)\a + b 
as was to be shown. 

Next, we will compute the quantity -^[(1^' — 
Proposition 4.2. We have for the above constructed exchangeable pair (V, V) 

E [iV' - Vf\V] = ^ ({2n + b-a)V~ InV + a) 



□ 



and hence 



-E{{y' -vf\v\=v{\-v) 



2n ^ ^ 2n' 



Proof. From the general theory of Gibbs sampling (see the author's PhD thesis, to 
appear) it is known, that 

E[{v' -vy\v] 

= ^(^E[Xn\V] +E[E[Xl\Xu . . . ,X„„i] I V] - 2E[XnE[X^\Xu . . . . 
Since = X„ we have from the proof of Proposition 14. II that 

a + nV — Xr, 



E[X^\Xi, . . . , — i^^[X„|Xi, . . . , Xn-i 



and hence 



E[E[X^\X^,...,Xn-i]\V] 



a + b + n — 1 
a + {n-l)V 



a + b + n — 1 

where we have used = V again. Finally, we compute 
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E[XnE[Xn\Xi,...,Xn-l]\V] =E 

aV + nV^-V {a-l)V + nV^ 



aX^ + nVX^-Xl 
a + 6 + n — 1 



V 



a+h+n—1 a+h+n—1 
Putting pieces together, we eventually obtain 



\ a + o + n — 1 a + o + n — 1 
= ^TT—rZ -A{2n + h-a)V-2nV^ + a 



The last assertion easily follows from this and from A 



1 



n{a+h+n—l) ' 



□ 



Recall that for the distribution Ua^b we have ri{x) := rja^bix) = x{l — x) and hence, 
we obtain from Proposition 14.21 that 



(44) 



E 



viV)~-E[{V'-Vr\V] 







= E 


"l a — b^^ a " 




[1 2n 2nU 



< 



\a — b\ + a 



2n 



since \V\ < 1. Similarly, since | V — V| = — X„| < ;^ we have 



^E[\V'-Vf] < 



n(a + b + n — l) 1 a + 6 + n — 1 



(45) 



6 n 
1 ^a + b-1 



6n^ 



O 



6n ' 6n^ vn/ 
From Theorem 13.91 we can now conclude the following result. 

Theorem 4.3. For each twice differentiable function /i : M — M with bounded first 
and second derivative we have 



\E[h{V)] - 

< (Ci\\h' 



a-b\+a , ^ ^1,^, ^,,,1 ^^1 , a + 6-l^\l 



CO — 2 + ^^(ll^'ll- + (e + ^ 



O 



n 



with the constants Ci and C2 from Proposition \3.8[ 
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Proof. Since V assumes only values in [0,1], the condition ii^ < oo from 
Theorem 13.91 is trivially met. The assertion now follows immediately from Theorem 
EH dS]) and g5]). □ 

Remark 4.4. (a) In |GR12] the authors use a different technique within Stein's 
method for the Beta distributions, which compares the Stein characterization 
of the target distribution with that of the approximating discrete distribution, 
to prove that, in the special case c = 1, the convergence rate of order from 
Theorem 14.31 even holds in the Wasserstein distance and they compute an explicit 
constant in the bound. They also show that the rate of convergence is optimal. 
Using their technique and the bounds from Proposition 13.81 one can easily see 
that the rate of order in the Wasserstein distance also holds in the case c > 2. 
However, in order to obtain an explicit constant, some further work has to be 
done to bound the constant Ci from Proposition 13.81 in the case that one of the 
values a, b is strictly smaller than one. 

(b) In [FGJ the authors use the zero bias coupling within Stein's method for normal 
approximation to prove bounds on the distance of a normalized version of the 
quantity V to the standard normal distribution. In particular, they show that 
a CLT holds whenever the parameters n, a and b tend to infinity in a suitable 
fashion. 

5. Appendix 

In this section we provide the proofs of some of the results from Sections [2] and [3] 
and state and prove some further auxiliary results, which are only used within proofs. 



5.1. A general version of de I'Hopital's rule. The following result justifies all 
our calculations, which invoke de I'Hopital's rule. Its proof is suppressed for reasons 
of space, but will be given in the author's PhD thesis. 

Theorem 5.1 (Generalization of one of de I'Hopital's rules). Let a < b be extended 
real numbers and let /, (7 : (a, 6) — )■ M be functions with the following properties: 

(i) If a < a' <b' <b, then both, f and g, are absolutely continuous on [a',b']. 

(ii) We have g'{x) > for X-almost all x G {a,b) and with E := {x & {O'^b) : 
g'{x) 7^ 0} it holds that lim„_j,oo = 5 G M for each sequence {xn)nm ^ E 
converging to a. 

If \imx\a f (x) = lima-x^a (x) = 0, then g{x) 7^ for all x G (a, 6) and 



X 



ua g[x) 
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The same conclusion holds if g'{x) < for almost all x G (a, 6) and an analogous 
result is true for limxyb- 

5.2. Proofs from Section [2], 

Proof of Proposition \2.1(A With h = h — fi{h), since I = rj ■ p, we have for a < x < 6 

i^^^^^i _ \i:h{t)p{t)dt\ \j:h{t)p{t)dt\ ^ F{x) 



15. ip we have 



\p{x)r](x)\ I{x) I{x) 

Fjx) 
I{x) 



Let M : (a, 6) — )• M be given by M{x) := By I'Hopital's rule (see Theorem 



vix] 1 1 

lim M{x) = hm — ^XTT ~ ^™ ~ 



c\a-f{x)p{x) x\a-f{x) \im^\^al{x) 

which exists in [0, oo) by Condition 12.21 Here, we used the convention ^ = 0. 
Moreover, 

hmM(x) = r— — = +00 

x/'b hm^y^bl{x) 

again by Condition 12.21 and by Proposition 12.41 Furthermore, we have 

M'ix) = = > 

for each x G (a, b) since by the positivity of p and because 7 is strictly decreasing 



I(x) = / -f{t)p{t)dt > 7(x) / p{t)dt = 7(x)F(x) . 

J a J a 

Hence, M is strictly increasing and thus for each x G (a,m]: 

F(m) \\h-fi{h)\\^ 



\9h{x)\ < \\h\ 



00 



I{m) 2/(m) 

The same bound can be proved for x G (m, h) by using the representation 



1 



b 



9h{x) = -y^ J {h{t) - fi{h))p{t)dt 

and the fact that also 1 — F{m) = |. 

□ 

The following two well-known lemmas will be needed for the proof of Proposition 
12.141 Their proofs are included only for reasons of completeness. 
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Lemma 5.2. Let — oo < a < 6 < oo and let /i be a probability measure (not neces- 
sarily absolutely continuous with respect to X) with supp(//) C (a, 6). Let F be the 
distribution function corresponding to ji and suppose that j^\x\diJi{x) < oo. Then, 
for each x G (a, b) we have 

(a) /; F{t)dt = xF{x) - Jj^sdi,{s) 

(b) /.'(I - m)dt = 4,^) sd^i(s) - x(l - F(x)) 
Proof. By Fubini's theorem we have 

f F{t)dt = = / F{t)dt= I F{t)dt= I (f dij{s))dt 

J a J —oo J {—oo,x] J {—oo,x] J {—oo,t] 

= / ( dt^dii{s)^ / {x-s)dii{s) 

J {—oo,x] Jls,x] J {—oo,x] 

— xF(x) — / sdijL{s) 

J (— oo,a;] 

= xF(x) — / sdiJ,{s) . 

J {a,x] 

This proves (a). Similarly, we have 



I {l-F{t))dt = / {l-F{t))dt= I ii{{t,oo))dt 

J X J X J (x,Qo) 

= / / dii{s)dt= / / dtdii{s) 

J (x.oo] J (t.oo) J (x.oo) J (x.s) 



— {s — x)diJ,{s) 

J (x,oo) 

= / sdi^{s) - x{l - F{x)) , 

J (x.oo) 



' {x,oo) 

proving (b). □ 

Lemma 5.3. Let — oo < a < 6 < oo and let /i be a probability measure (not neces- 
sarily absolutely continuous with respect to X) with supp(/i) C (a, 6). Let F be the 
distribution function corresponding to ji, let Z fx and let : (a, 6) — )> M be Lipschitz 
continuous with E[\h{Z)\] < oo. Then the following assertions hold true: 

(a) For each y we have 

h{y) - m = P_^F{s)h'{s)ds - - F{s))h'{s)ds. 



40 



CHRISTIAN DOBLER 



(b) For each x e (a, h) we have 

j^^^^^{h{y)-n{h))dn{y) = -(l-F(x)) j: F{s)h'{s)ds-F{x) f^{l-F{s))h'{s)ds. 



Proof. Since is a probability measure we have by the fundamental theorem of 
calculus for Lebesgue integration and by Fubini's theorem 



h{y) - Kh) = 



{h{y) - h{t)) dn{t) = j (^j\'{s)ds^d^{t) 
= / (/ h'{s)ds)dii{t)- [ ([ h\s)ds)dii{t) 

J{-oc,y]^J[t,y] ' J {y, oo)^ J [y,t) ^ 

= / ( / dfl{t))h\s)ds- [ (I dfi{t))h'{s)d6 

= I F{s)h'{s)ds - j (1 - F{s))h'{s)ds 

/y /'oo 
F{s)h'{s)ds- {1- F{s))h'{s)ds. 
-oo J y 



This proves (a). As to (b), we have using (a) and its proof 
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{h{y) - fi{h))d^{y) = / {h{y) - ^{h))dfi{y) 

a,x] J (—00,3;] 

F{s)}i{s)ds- I {1- F{s))h'{s)ds)dfx{y) 

-00, x] ^J{—QO,y] J{y,oo) 

F{s)h'{s)dsd^i{y)- I j {I - F{s))h'{s)dsd^i{y) 

— oo.a;] J {—oo.y) J [—oo.x] J{y,oo) 



-oo,x) 



F{s)h\s)(! dM)ds- [il~Fis))h'is)([ d^i{y)]ds 



{s,x] 



F{s)h'{s){F{x) - F{s))ds - / F{s){l - F{s))h'{s)ds 

-00, x) J (—00, a:] 



F{x){l- F{s))h'{s)ds 



(x,oo) 



r (F{s)F{x)h'{s) - F{syh'{s) + F{s)''h'{s) - F{s)h'{s)')ds 
J —00 

/•OO 

-F{x) / {1- F{s))h'{s)ds 

J X 

/X poo 
F{s)h'{s)ds- F{x) / {1- F{s))h'{s)ds 
'OO J X 

px pb 

-{1-F{x)) / F{s)h'{s)ds- F{x) / {1- F{s))h'{s)ds, 

J a J x 



as claimed. 



□ 



Proof of Proposition 2.14 First, we prove (a). Recall the representation 



1 r 1 

gh{x) = — - / {h{y) - fi{h))p{y)dy 



m J a 



J{a,x] 



{h{y) - ^{h))d^{y) . 



By Lemmas 15.31 and 15.21 we thus obtain that 
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< 



I{x)ghix)\ 

-(l-F(x)) / F{s)h\s)ds- F{x) / {I- F{s))h\s)ds 

J a J X 

px pb 

(l-F(x)) / F{s)ds + F{x) / {1-F{s))ds 

J a J X 



oo 



{I - F{x))(xF{x) - sp{s)ds^ + F{x)[-x{l~ F{x)) + J sp{s)ds 

px px pb 

I sp{s)ds + F{x)i / sp{s)ds + / sp{s)ds 

a J a J X 

F{x)E[Z] - ^ yp{y)d^ , 
implying (a). 

Now, we turn to the proof of (b). By Stein's equation (fTTl) we obtain for x G (a, h) 

(46) g'f^{x) = (}i{x) - -f{x)gh{x)^ , 

where we have again written h = h — fi{h). Using Lemma [5.31 again, we obtain 

-f{x){l-F{x))- 



+ [ {1- F{s))h'{s)ds(-1 + 

J X 

F{s)h'{s)ds 



ri{x)p{x) 
7(x)F(a;) 



ri{x)p{x 

rj{x)p{x) + 7(a;)(l — F{x)) 
ri{xYp{x) 

,47) 



a 



Now consider the functions if, G : (a, 6) — t- M with H[x) = I{x) — 'y{x)F{x) = 
r]{x)p{x) — '-y{x)F{x) and G{x) = H{x) + ■y{x) = rj{x)p{x) + 7(x)(l — F{x)). It 
was aheady observed in the proof of Proposition 12.101 that H is positive on {a,b). 
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Similarly we prove the positivity of G on (a, 6): For x in (a, 6) we have, since p is 
positive and 7 is strictly decreasing 



G{x) = /(x)+7(a;)(l-F(x)) = - / -f{t)p{t)dt + -f{x){l - F{x)) 

J X 

> -7(x)(l +7(x)(l = 0. 
By (H7|) we can thus bound 



\9'Hi^)\ < \\h'\ 



F{s)ds 



G{x) 



(4J 



+ 



F{s))ds 



ri{xYp{x) 

H{x) 
rj{xYp{x) 



which reduces to the bound asserted in (b). 



□ 



Proof of Proposition \2.25[ The bound on \gh{x) \foY x G (a, b) has already been proved 
in Proposition 12.101 Let x G (—00, a). Then we have by the negativity of qi which 
follows from Proposition 12.231 



\9h{x)\ 



exp(F;(a;)) 



< \\h\ 



\h\ 



'exp(F,(x)) 



We want to show, that this is bounded from above by . ^^^^^ 



exp{Fi{x)) 

°°exp(Fi(x)) 

J^^^. To this end, we define 



the function D(x) :- 



exp(F;(a;)) 
7(a) 



— Qi{x), X G (—00, a), and show that D{x) > 0. By 



Condition 12. 191 and Proposition 12.231 we have limx/^a F>{x) = and furthermore 



D'(x) 



?Mexp(FKx)) 



Qiix) = qi{x) 



j{x) 



1 < 0, 



7(a) V7(a) 

since qi{x) < and 7(x) > 7(a) by Condition 12.181 Thus, D is strictly decreasing 
and hence D{x) > for each x G (—00, a). This proves the desired bound for 

X G (— c>o, a). Similarly one proves that |5'/i(a;)| < — ■'^^^ for each x G (&, 00). □ 



Next, we will state a lemma, which replaces Lemma [521 outside the support (a, b). 
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Lemma 5.4. (a) For each x G (—00, a) we have Qi{t)dt = —xQi{x) + tqi{t)dt 

and Ii{x) < ^{x)Qi{x). 
(b) For each x G {b,oo) we have J^^ Qr{t)dt = xQr{x) — j^^ tqr{t)dt and Ir{x) < 

'j{x)Qrix). 

Proof. By Fubini's theorem we have 

Qi{t)dt = j^^j^ qi{s)ds^dt = - i^j^ qi{s)ds^dt 



Idt jqi{s)ds = — I {s — x)qi{s)ds 
-xQi{x) + / sqi{s)ds, 



which proves the first part of (a). The second claim of (a) follows from (a), the 
positivity of —qi on {—00, a) and from the monotonicity of 7: 



hix) = / ^{t)qi{t)dt= / ^{t){-qi{t))dt 

J a J X 

pa PX 

< lix) / {-'liit))dt = -fix) / qi{t)dt 

J X J a 

= -^{x)Qi{x) 

The proof of (b) is similar but easier, and is therefore omitted. □ 

The next lemma replaces Lemma [5.31 outside of the support of fi. 

Lemma 5.5. (a) For each x G {—00, a) we have 

h{x) - fi{h) = - J^{1 - F{s))h'{s)ds = - h\s)ds - - F{s))h\s)ds and 

9h{x) = j^[Qi{x)-Qi{s))h\s)ds~Qi{x) j^{l-F{s))h\s)ds 

1 



Qi{s)h'{s)ds-Qi{x) / {l-F{s))h'{s)ds\. 
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(b) For each x G (6, oo) we have h{x) — = F{s)h'{s)ds and 

9h{x) = j^{Qr{x) j^F{s)h'{s)ds + h\s){Qr{x)-Qr{s))ds 

Qr{x) / F{s)h'{s)ds - / Qr{s)h'{s)ds ] . 



1 



Ir(x) 



r\-^J \ Ja Jb 



Proof. We only prove (a) since the proof of (b) is very similar. The first claim follows 
from Lemma [5.31 (a) since F{s) = for s < a and F{s) = 1 for x > b. The second 
claim follows from the first one and from Fubini's theorem by 



9hix) 



h{x) Ja k[x) Ja \ Jy ' J 



Ky)qi{y)dy = —- \- [l - F{s))h\s)dsU{y)dy 



Ii{x 
1 



b 



{1-Fis))h'is)dsjqiiy)dy 

(paAs \ 

= yi- / {l-F(3)]h'(s){Q,(aA3)-Q,(x))d3 

= f h'{s){Qi{s) - Qi{x))ds + f {l-F{s))h'{s)ds{Qi{a)-Qi{x)) 

^l\X) Jx J-l\X) Ja 

= [{Qii^) - Qi{s))h'{s)ds - Qi{x) j\l - F{s))h'{s)ds 

This is the first representation for gh{x) in the assertion. The second one follows, 
since 1 — F{s) = 1 for s < a and hence 

-/ {Qi{x)~Qi{s))h'{s)ds= [ Qi{s)h'{s)ds-Qi{x) ! {1-F{s))h'{s)ds. 

J X J X J X 

□ 
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Proof of Proposition IKWh We only prove (a) and (c), since the proofs of (b) and (d) 
are similar. To prove (a), we observe that by Lemma [5.51 we have 

gh{x) = J^(^- ~ Qii^))h'{s)ds - Qi{x) j\l - F{s))h'{s)ds 

Since Qi is decreasing {Q[ = qi < 0) and positive on (— oo, a) this implies 



k(x)|<^^|^y {Qi{x)-Qiis))ds + Qi{x) j {1-F{s))ds 
By Lemma 15.21 and Lemma 15.41 (a) the right hand side equals 



^(a - x)Qi{x) + xQiix) - sqi{s)ds + Qi{x) {E[Z] - a) 
Qi{x)E[Z] - / sqi{s)ds 



Iix) 



which is the claimed bound. Now, we turn to the proof of (c) . By Stein's equation 
Ml) and Lemma [5.51 (a) we have for each x G (— oo, a): 



g'nix) = ^ - ^^g,{x) = /V - F{s))h'{s)ds 

r]{x) r]{x) ri[x) 



7(x) 



ri{x)Ii{x] 
-1 



b 



r]{x)Ii{x 
By Lemma 15.21 (a) we have 



Qi{s)h'{s)ds-Qi{x) / {1-F{s))h'{s)d^ 

J X 

l{x) ( Qi{s)h'{s)ds+ f {l-F{s))h'{s)ds(li{x)-Qi{x)-^{x) 



pb pb 

/ {1- F{s))ds = a-x+ {l-F{s))ds = a-x + E[Z]-a = E[Z]-x. 

J X J a 

Hence, by Lemma [5.41 (a) this implies 
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7(x) [-xQiix) + f tqi{t)dt) + {E[Z] - x) {Qi{x)-i{x) - Ji(x) 



-?7(x)/;(a;) 

which was to be shown. □ 
Proof of Corollary \2.28[ In this case, we have the relations 

(49) Ii{x) = c {E[Z]-t)qi{t)dt = c(Qi{x)E[Z]- tqi{t)dt\ 

J a J a 

Ir{x) = C {E[Z] - t)qr{t)dt = c(^Qr{x)E[Z] - tqr{t)dt^ . 

These together with Proposition 12.271 (a), (b) and Corollary 12.161 immediately 
imply (a). As to (b), by (I49p we have 



{E[Z]-x) [Qi{xh{x)-Iiix)^ 
{E[Z] - x) (cE[Z]Qi{x) - cxQi{x) - c{Qi{x)E[Z] - tqi{t)dt)^ 

c{E[Z] - x) [-xQi{x) + ^ tqi{t)dt^ 



'-^{x)[-xQi{x) + / tqi(t)dt 



This and Proposition 12.271 (c) imply claim (b). Assertion (c) may be proved simi- 
larly. □ 

Proof of Proposition \2. 29i For any x G (a, b) we have: 

9zix) = 7^/ {li-oo,4t) - PiZ < z))p{t)dt 

rxAz 



— (y p{t)dt-F{z)F{x: 



F{x Az)- F{z)F{x) 

proving the desired representation of Qz inside the interval (a, b). Now, for x G (a, b) 
let M{x) := and N(x) := Then we have 
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and 



N{x) = = 7(^(nx) + (l-F(x))7(x) 



I{x 

since G{x) = {I{x) + (1 — F{x))'~f{x)) is positive by Proposition 12.141 Thus, M 
is strictly increasing and N is strictly decreasing on {a,b). Since gz{x) = (1 — 
F{z))M{x) for X G (a, 2;] and gz{x) = F{z)N{x) for x G this implies, that 

^^Px€{a,b) 9zix) = Qziz) = ™pli6S the claimed representation of 

g'z{x) for x G (a, 6) \ {z}. Furthermore, by de I'Hopital's rule, we have 

\imgz{x) = (1 -F(z))limM(x) = (1 -F(2))lim ^^^^ 



x\a x\a x\a p(^x)'j(x) 

x\a 7(a) 

and 



— ui X] 

lira gz{x) = lim A^(x) = lim 



x/^b x/^b x/'b p(x)'j{x) 

x/'b 7(x) 7(6) 

Note, that these limits could also be derived from Proposition 12.221 
Next, consider x G {—00, a). For such an x we have 



k{x)J^ ^ ^ - ^^-^'^^ Ii{x) 

Moreover we have 
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d Qi{x) qi{x)Ii{x) - -f{x)qi{x)Qi{x) 



dx Ii{x) Ii{xy 
-qi{x" 



Iiix 



i2 



^{x)Qiix)-Ii{x)) >0 



by Lemma[531 Thus, Qz is increasing on (—00, a) and hence, again by de I'Hopital's 
rule, 

sup \gz{x)\ = sup gz{x) = {1 - F{z)) lim ^^^^^ 



""I' \^z\-^j\ -j^j^ ^z\-^j \^ ^ \~ J J T ( \ 

xe{-oo,a) ze(-oo,a) -^lyX) 

= (1 - F{z)) lim ■'^[^l ^ = (1 - F{z)) lim 



1 - m 

7(a) 

Since g'^ix) = (1 — -^(^))^^^ we have also derived the desired formula for 
g'z{x) for X G (—00, a). The calculations for a; G C)o) are completely analogous 
and therefore omitted. From our computations we can already infer, that H^'zHoo = 
^^^''f{z)^^^^ . So, it remains to show that this quantity is bounded in z G {a,b). Since 
it is a continuous function of z, we only have to show that it has finite limits on the 
edge of the interval (a, 6). But, of course, 

F{z){l-F{z) F(z) , 1 

lim ^ ^\ — — = lim -f-^ = lim M{z) 



z\a K^z) z\a I(z) z\a 7(a) 

4i^ = lim.^,iV(.) = -:J^. 



and, similarly, lim^^^ ^^^^Uz f^^^ ~ li^z/'b N{z) = — 7^. This concludes the proof. 



□ 

5.3. Proofs from Section [3l 

Proof of Proposition ISTB . Let us focus on the case z G (—1, 1), the cases z < —1 and 
z > 1 being similar and even easier. Of course, g^ is continuously differentiable on 
each of the intervals (—00, —1), (—1, z), {z, 1) and (1, 00) and from Proposition 12.221 
we know that gz is continuous on R since 2; 7^ ±1. By a result from calculus, to 
show that, for example, gz is on [—1, z], it is sufficient to prove that lim2;\_i g'zix) 
exists in R. Tot this end we recall from Remark I2.3UI that for — 1 < a; < 2; we have 
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Using de FHopital's rule 15.11 and Lemma [5l6] (a) below, we obtain 

(a + /3 + 2)F{x) 



(l - F{z)) lim 



x\-i (1 - x'^)p{x)[l3 -a - {a + 13 + A)x] 



l-F{z) (a + /3 + 2)F(x) 



2/3 + 4 

l-F{z) (a + /3 + 2)p(x 

lim 



2/3 + 4 x\-i p{x)[l3 - a - {a + 13 + 2)x] 
(1 - (g + /3 + 2) 

(2/3 + 4)(2/3 + 2) ■ 

Hence, Qz is continuously differentiable on [—1,2;]. That Qz is continuously differ- 
entiable on the intervals {—oo, 1], [z, 1] and [1, oo) can be proved similarly by using 
the representations for g'z{x) from Proposition 12.291 It also turns out, that the right 
and left hand derivatives of Qz at ±1 coincide, so that it is in fact differentiable at 
these points. 

So, we conclude, that Qz is continuous and piecewise continuously differentiable on 
M. Next, we show that Qz vanishes at infinity. This even applies to any bounded test 
function h, since e.g. for x > 1 we have 



\9hix)\ 



< \\h 



exp{-Fr{x)) J h{t)qr{t)dt 
j^qr{t)dt 

{x-iY+'^{i + xY+^ 



and by de I'Hopital's rule 



j^qr{t)dt ^ ^.^ j^{t-ini + tfdt 

™ {X - 1)°+1(1 + xY+^ {X - 1)"+1(1 + XY+^ 

lim (x-Vril+^> ^ 

(x - 1)"(1 + xY{{a + 1)(1 + x) + (/3 + l)(x - 1)) 



x^c» (a + 1)(1 + x) + (/3 + l)(x - 1) 

Hence, \\mx^oo gh{x) = 0. Similarly, one can prove that limx^-oo gh{x) = 0. 
It remains to show that 
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\9'zix)\0c, (ii^)^^ = / \g',ix)\{l ~ x^)p{x)dx < oo . 
-1 J-i 

To this end, it suffices to see that the function |5f!,(x)|(l — x'^)p{x) is bounded 
on (—1,2;) and on (-2,1). Since it is continuous on (— 1,^;] and on (-2,1) (where 
g'z{z) := lim.x/'z g'zi^) for definiteness) , this claim will follow if we have proved that 
lima;_j.±i g'^{x){l — x^)p{x) = 0. For x G (—1, z) we have 

g',{x){l-x')p(x)^{l-F(z)]^^ and 

x\-l 1 — x\-l —2x 

proving the claim for —1. Since it may be proved analogously for +1 the proof is 
complete. □ 

The following lemma will be useful for the proof of Proposition 13.41 

Lemma 5.6. The functions p, qi, qr and rj satisfy the following equations for each 
integer k > 1: 

(a) ■£:{v{x)''p{x)) = rj{xY^^p{x) \{k — l)rj'{x) + 7(2;)] for each x G (—1, 1) 

(b) ^{ri{xYqi{x)) = ri{xY^^qi{x) \{k — l)ri'{x) + 7(0;)] for each x E (—00, —1) 

(c) ■£:{ri{x)''qr{x)) = ri{xY~^qr{x) \{k — l)rj\x) + 7(0;)] for each x G (1, 00) 

Proof. First we prove (a). By (fT4l) we have, multiplying by p{x), 

rj{x)p'{x) = p{x)['y{x) — ri'{x)) . 
Hence, by the product rule we obtain 

— (r7(x)V(3;)) = kri{x)''^^rj'{x)p{x) + ri{x)^p'{x) 

= 7]{x)^~^ \kr]'{x)p{x) + r]{x)p'{x)\ 
= r]{x)^~^p{x) \kr]'{x) + 7(x) — 'r]'{x)\ 
= ri{xf-^p{x) [{k - l)ri\x) + 7(x)] , 

proving (a). As to (b), we observe that by the definition of qi = we have on 

the one hand 
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^ ■{r]{x)qi{x)) = ■^exp{Fi{x)) = exp{Fi{x))'^^^^ 



ax ax ri[x) 

= i{x)qi{x) 

and on the other hand, by the product rule, 

^(?7(x)gi(x)) = r]'{x)qi{x) + q'iix)r]{x) . 
These two equations yield 



r]{x)q[{x) = 'y{x)qi{x) - r]'{x)qi{x) = qi{x){-f{x) - r]'{x)) . 
Now the proof follows the lines of proof for p as above. Similarly one may prove 



Proof of Proposition \3.4[ Assertion (a) immediately follows from Corollary 12.281 (a) 



since in this case c = a + P + 2. Now, we turn to the proof of (b). First, consider 
X G (—1, 1). By Corollary 12.161 (b) we have for x G (—1, 1): 



, , , 2||/i'|U H{x)G{x) 



a + /3 + 2 I{x)ri{x) 



For X G (-1, 1) let 



^^^^ ^ H{x)G{x) H{x)G{x) 



I{x)ri{x) ri{xyp{x) 

Then 5" is continuous on (—1, 1) and hence, to show that 5* is bounded on (—1, 1), it 
suffices to prove that S has finite limits at ±1. Since limj.\^_i G{x) = 7(— 1) = 2/3 + 2 
we obtain, using Lemma [5.61 and de I'Hopital's rule: 
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lim S{x) = lim G{x) lim ^ ^ 



x\-i x\~i x\-i ri{xYp{x) 



^ (?7(a;)^p(x)) T]{x)p{x) [q'ix) + 7(x)] 

7(-l)(a + /3 + 2) ^.^ 



lim^\^_i [r/'(x) + 7(0;)] r]{x)p{x) 

7(-l)(« + /3 + 2) p{x) 



x] 



\im^\-i [-2x - {a + (3 + 2)x + (3 - a] x\-i j{x)p{ 
7(-l)(a + /3 + 2) 1 _ 7(-l)(a + /3 + 2) 1 



2/3 + 4 :r\-i7(x) 2/3 + 4 7(-l) 

a + (3 + 2 
2/3 + 4 



< 00 



Here we have used, that H'{x) = —'~f'{x)F{x) = (a + /3 + 2)F(x). Similarly one 
shows that 



lim S(x) = — ^ . 
x/'i ^ ^ 2a + 4 

Thus, we have shown that sup3,g(_]^^i-) 5(2;) < 00. Now, we consider x G (—00, — 1). 
From Corollary 12.281 (b) we have 



7(x) ( -xQi{x) + r tqi{t)dt 
\g'^{x)\<2\\h'\U ^ 



~r]{x)Ii{x 

For X G (—00, —1) we consider the function 



7(x) [-xQi{x) + j^tqi{t)dtj ^(2;) Q{t)dt 
' ' -ri{x)Ii{x) r]{xYqi{x) 

Clearly, 5"; is a continuous function on (— 00, — 1). To show that it is bounded, 
it thus suffices to prove that \im.xy^_iSi{x) < 00 and lim^^_oo 'S';(x) < 00. Using 
Lemma [5.61 and de I'Hopital's rule, we obtain 
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rQ{t)dt 

lim SAx) = lim 7(0;) lim " ,^ — -— 
x/^-i x/--\ X/--1 rj[xYqi[x) 

7(-l) lim 

rj{x)qi{x) \ri'{x) + 7(0;)] 

7(-l) Qi{x) 



lima:/._i [r]'{x) + 7(x)] ^/-i r]{x)qi{x) 

2/3 + 4 x/-! 7(x)gi(x) 2/3 + 4 a;/--! 7(x) 
1 



< 00 . 



2/3 + 4 

Next, we will show that lim2;^-oo Si{x) = 0. We actually have 
lim -(" + ^ + ^'^ + ^-"=0 

x^-oo r]{x) X^-OQ 1 — X^ 

and by de I'Hopital's rule, also 

Here we have used that ri{x)qi{x) = —(1 — 1 — x)'^^^ — )■ —00 as a; — )■ — cxd. 

Hence, \imx^_oo Si{x) = and sup^g(-_oo ^/(x) < 00. Since we can show in a 
similar manner that sup^.^^^^ Sr{x) < 00, where 5*^ is defined in the obvious way, 
the proof is complete. □ 

Proof of Lemma \3. 51 If E[u{Zy\ = 0, then the usual Stein solution is bounded on 
(— 1, 1) by Proposition 13.31 For the converse, let us assume that E[u{Z)] 7^ 0. As was 
already noted in Section [21 the solutions of the homogeneous equation corresponding 
to (HUj) are exactly the multiples of ^. Thus, every solution / of ( HOl) has the form 

r_^u{t)p{t)dt+c 

f{x) = f \ f \ ' a;e -1,1 , 

ri[x)p{x) 

for some c G M. If c = 0, then 

hm / [X) = lim r^r~r^ — = ="=00 , 

x/'i x/-\ r]\x)p\x) 

since E\u{Zy\ 7^ 0. Hence, / is unbounded near 1. If c 7^ we have 
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lim f(x) — lim = ±00 

a:\-l x\-l r)[x)p[x) 



So, / is unbounded near —1. Hence, in any case / is unbounded on (—1,1). 



□ 
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